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<g) Procaryotlc carbonyl hydrolases, methoda. DMA, vectors and transformed hosts for producing them, and detergent 

compositions containing them. 

@ Methods and vectors are provided for the production of 

procaryotic carbonyl hydrolases in recombinant systems DN A 

which encodes such hydrolases are mutated at predetermined 

regions by known methods or by a novel point mutagenesis 

method in order to generate mutant hydrolases. Pan»cuiaf 

point mutations in carbonyl hydrolases such as subtdtsm resuu 

in modifications of oxidation stability. Km. Kcat. Kcat Km ratio 

substrate specificity, specific activity or pH-activrry profiles 

These mutated hydrolases are particularly useful m laundry 
•s™ compositions. Mutations in thegenes encoding the sufctitisino? 

neutral protease of bacillus yield substantially normally sporu 
" lating bacillus strains which are incapable of excreting subM 
^ isin or neutral protease . Such strains are useful m the » ecomtx 
™ nsnt synthesis of heterologous proteins. 
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10 PROCARYOTIC CARBONYL -HYDROLASES , -METHODS, DNA, 

VECTORS AND TRANSFORMED HOSTS FOR PRODUCING 
THEM, . AND DETERGENT COMPOSITIONS CONTAINING THEM 
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20 Background 

This Invention relates to the production and manipulation of 
proteins using recombinant techniques in suitable hosts. More 
specifically, the invention relates to the production of procaryotic 
25 proteases such as subtilisin and neutral protease using recombinant 
microbial host cells, to the synthesis of heterologous proteins by 
microbial hosts, and to the directed mutagenesis of enzymes in order 
to modify the characteristics thereof. 

30 Various bacteria are known to secrete proteases at some stage in 

their life cycles. Bacillus species produce two major extracellular 
proteases, a neutral protease (a metal loprotease inhibited by EDTA) 
and an alkaline protease (or subtilisin, a serine endoprotease) . 
Both generally are produced in greatest quantity after the 

35 exponential growth phase, when the culture enters stationary phase 
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and begins the process of sporulation. The physiological role of 
these two proteases is not clear. They have been postulated to play 
a role in sporulation (J. Hoch, 1976, "Adv. Genet." 18:69-98; 
P. Piggot et al_., 1976, "Bact. Rev." 40:908-962; and F. Priest, 
5 1977, "Bact. Rev." 4^:711-753) , to be involved in the regulation of 
cell wall turnover (I. Jolliffe et aK, 1980, "J. Bact." 
141:1199-1208), and to be scavenger enzymes (Priest, Id.). The 
regulation of expression of the protease genes is complex. They 
appear to be coordinately regulated in concert with sporulation, 
10 since mutants blocked in the early stages of sporulation exhibit 
reduced -levels of both the alkaline and neutral protease. 
Additionally, a number of pleiotropic mutations exist which affect 
the level of expression of proteases and other secreted gene 
products, such as amylase and levansucrase (Priest, Id.). 

15 

Subtil isin has found considerable utility in industrial and 
commercial applications (see U.S. Patent No. 3,623,957 and 
J. Millet, 1970, "J. Appl . Bact." 33:207). For example, subtilisins 
and other proteases are commonly used in detergents to enable 
20 removal of protein-based stains. They also are used in food 

processing to accommodate the proteinaceous substances present in 
the food preparations to their desired impact on the composition. 

Classical mutagenesis of bacteria with agents such as radiation 
25 or chemicals has produced a plethora of mutant strains exhibiting 
different properties with respect to the growth phase at which 
protease excretion occurs as well as the timing and activity levels 
of excreted protease. These strains, however, do not approach the 
ultimate potential of the organisms because the mutagenic process is 
30 essentially random, with tedious selection and screening required to 
identify organisms which even approach the desired characteristics. 
Further, these mutants are capable of reversion to the parent or 
wild-type strain. In such event the desirable property is lost. 
The probability of reversion is unknown when dealing with random 
35 mutagenesis since the type and site of mutation is unknown or poorly 
0992Y 
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characterized. This introduces considerable uncertainty into the 
industrial process which is based on the enzyme-synthesizing 
bacterium. Finally, classical mutagenesis frequently couples a 
desirable phenotype, e.g. , low protease levels, with an undesirable 
5 character such as excessive premature cell lysis. 

Special problems exist with respect to the proteases which are 
excreted by Bacillus. For one thing, since at least two such 
proteases exist, screening for the loss of only one is difficult. 
10 Additionally, the large number of pleiotropic mutations affecting 
both sporulation and protease production make the isolation of true 
protease mutations difficult. 

Temperature sensitive mutants of the neutral protease gene have 
15 been obtained by conventional mutagenic techniques, and were used to 
map the position of the regulatory and structural gene in the 
Bacillus subtil is chromosome (H. Uehara et aK, 1979, M. Bact. H 
139:533-590), Additionally, a presumed nonsense mutation of the 
alkaline protease gene has been reported (C. Roitsch et aK , 1983, 
20 "J. Bact." 155:145-152). 

Bacillus temperature sensitive mutants have been isolated vhat 
produce inactive serine protease or greatly reduced levels of serine 
protease. These mutants, however, are asporogenous and show a 

25 reversion frequency to the wild- type of about from 10" 7 to 1(T 8 
(F. Priest, Id. p. 719). These mutants are unsatisfactory for the 
recombinant production of heterologous proteins because asporogenous 
mutants tend to lyse during earlier stages of their growth cycle in 
minimal medium than do sporogenic mutants, thereby prematurely 

30 releasing cellular contents (including intracellular proteases) into 
the culture supernatant. The possibility of reversion also is 
undesirable since wild-type revertants will contaminate the culture 
supernatant with excreted proteases. 
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Bacillus sp. have been proposed for the expression of 
heterologous proteins, but the presence of excreted proteases and 
the potentiai resulting hydrolysis of the desired product has 
retarded the conFi?»-cial acceptance of Bacillus as a host for the 
5 expression of heterologous proteins. Bacillus mega teri urn mutants 
have been disclosed that are capable, of sporulation and which do not 
express a sporulation-associated protease during growth phases. 
However, the assay employed did not exclude the presence of other 
proteases, and the protease in question is expressed during the 

10 sporulation phase (C. Loshon et al_. , 1982, "J. Bact." 150:303-311). 
This, of course, is the point at which heterologous protein would 
have accumulated in the culture and be vulnerable. It is an 
objective herein to construct a Bacillus strain that is 
substantially free of extracellular neutral and alkaline protease 

15 during all phases of its growth cycle and which exhibits 

substantially normal sporulation characteristics. A need exists for 
non-revertible, otherwise normal protease deficient organisms that 
can then be transformed with high copy number plasmids for the 
expression of heterologous or homologous proteins. 

20 

Enzymes having characteristics which vary from available stock 
are required. In particular, enzymes having enhanced oxidation 
stability will be useful in extending the shelf life a;id bleach 
compatibility of proteases used in laundry products. Similarly, 
25 reduced oxidation stability would be useful in industrial processes 
that require the rapid and efficient quenching of enzymatic activity. 

Modifying the pH-activity profiles of an enzyme would be useful 
in making the enzymes more efficient in a wide variety of processes, 
e.g. broadening the pH-activity profile of a protease would produce 
an enzyme more suitable for both alkaline and neutral laundry 
products. Narrowing the profile, particularly when combined with 
tailored substrate specificity, would make enzymes in a mixture more 
compatible, as will be further described herein. 
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Mutations of procaryotic carbonyl hydrolases (principally 
proteases but including lipases) will facilitate preparation of a 
variety of different hydrolases, particularly those having other 
modified properties such as Km, Kcat, Km/Kcat ratio and substrate 
5 specificity. These enzymes can then be tailored for the particular 
substrate which is anticipated to be present, for example in the 
preparation of peptides or for hydrolytic processes such as laundry 
uses. 

10 Chemical modification of enzymes is known. For example, see I. 

Svendserr, 1976, "Carlsberg Res. Commun." 41_ (5): 237-291. These 
methods, however, suffer from the disadvantages of being dependent 
upon the presence of convenient amino acid residues, are frequently 
nonspecific in that they modify all accessible residues with common 

15 side chains, and are not capable of reaching inaccessible amino acid 
residues without further processing, e.g. denaturation, that is 
generally not completely reversible in reinstituting activity. To 
the extent that such methods have the objective of replacing one 
amino acid residue side chain for another side chain or equivalent 

20 functionality, then mutagenesis promises to supplant such methods. 

Predetermined, site-directed mutagenesis of tRNA synthetase in 
which a cys residue is converted to serine has been reported 
(G. Winter etaU, 1982, "Nature" 299:756-758; A. Wilkinson et aj. , 
25 1984, "Nature" 307:187-188). This method is not practical for large 
scale mutagenesis. It is an object herein to provide a convenient 
and rapid method for mutating DNA by saturation mutagenesis. 

30 Summary 

A method for producing procaryotic carbonyl hydrolase such as 
subtil i sin and neutral protease in recombinant host cells is 
described in which expression vectors containing sequences which 
35 encode desired subtil isin or neutral protease, including the pro, 
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pre, or prepro forms of these enzymes, are used to transform hosts, 
the host cultured and desired enzymes recovered. The coding 
sequence may correspond exactly to one found in nature, or may 
contain modifications which confer desirable properties on the 
5 protein that is produced, as is further described below. 

The novel strains then are transformed with at least one DNA 
moiety encoding a polypeptide not otherwise expressed in the host 
strain, the transformed strains cultured and the polypeptide 

10 recovered from the culture. Ordinarily, the DNA moiety is a 
directed- mutant of a host Bacillus gene, although it may be DNA 
encoding a eucaryotic (yeast or mammalian) protein. The novel 
straiins also serve as hosts for protein expressed from a bacterial 
gene derived from sources other than the host genome, or for vectors 

15 expressing these heterologous genes, or homologous genes from the 
host genome. In the latter event enzymes such as amylase are 
obtained free of neutral protease or subtilisin. In addition, it is 
now possible to obtain neutral protease in culture which is free of 
enzymatically active subtilisin, and vice-versa. 

20 

One may, by splicing the cloned genes for procaryotic carbonyl 
hydrolase into a high copy number plasmid, synthesize the enzymes in 
enhanced yield compared to the parental organisms. Also disclosed 
are modified forms of such hydrolases, including the pro and prepro 
25 zymogen forms of the enzymes, the pre forms, and directed mutations 
thereof. 

A convenient method is provided for saturation mutagenesis, 
thereby enabling the rapid and efficient generation of a plurality 
3 q of mutations at any one site within the coding region of a protein, 
comprising; 

(a) obtaining a DNA moiety encoding at least a portion of 
said precursor protein; 

35 
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(b) identifying a region within the moiety; 

(c) substituting nucleotides for those already existing 
within the region in order to create at least one 

5 restriction enzyme site unique to the moiety, whereby unique 

restriction sites 5' and 3' to the identified region are 
made available such that neither alters the amino acids 
coded for by the region as expressed; 

10 (d) synthesizing a plurality of oligonucleotides, the 5' and 

3' ends of which each contain sequences capable of annealing 
to the restriction en?yme sites introduced in step (c) and 
which, when ligated to the moiety, are expressed as 
substitutions, deletions and/or insertions of at least one 

15 amino acid in or into said precursor protein; 

(e) digesting the moiety of step (c) with restriction 
enzymes capable of cleaving the unique sites; and 

20 (f) ligating each of the oligonucleotides of step (d) into 

the digested moiety of step (e) whereby a plurality of 
mutant DMA moieties are obtained. 

By the foregoing method or others known in the art, a mutation 
25 is introduced into isolated ONA encoding a procaryotic carbonyl 
hydrolase which, upon expression of the DNA. results in the 
substitution, deletion or insertion of at least one amino acid at a 
predetermined site in the hydrolase. This method is useful in 
creating mutants of wild type proteins (where the "precursor" 
30 protein is the wild type) or reverting mutants to the wild type 
(where the "precursor 1 * is the mutant* 

Mutant enaymes are recovered which exhibit oxidative stability 
and/or pH-activity profiles which differ from the precursor 
35 enzymes. Procaryotic carbon>l hydrolases having varied Km, Kcat, 
Kcat/Km ratio and substrate specificity also are provided herein. 

0992Y 
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The mutant enzymes obtained by the methods herein are combined 
in known fashion with surfactants or detergents to produce novel 
compositions useful in the laundry or other cleaning arts. 

5 

Brief Description of the Drawings 

Figure 1 shows the sequence of a functional amyloliquefaciens 
subtil i sin gene. 

10 

In Figure 1A, the entire functional sequence for 13. 
amy! ol i quef aci ens , including the promoter and ribosome binding site, 
are present on a 1.5 kb fragment of the amylol iquefaciens genome. 

15 Figure IB shows the nucleotide sequence of the coding strand, 

correlated with the amino acid sequence of the protein. Promoter 
(p) ribosome binding site (rbs) and termination (term) regions of 
the DMA sequence are also shown. 

20 Figure 2 shows the results of replica nitrocellulose filters of 
purified positive clones probed with Pool 1 (Panel A) and Pool 2 
(Panel B) respectively. 

Figure 3 shows the restriction analysis of the subtil isin 
25 expression plasmid (pS4). pBS42 vector sequences (4.5 kb) are shown 
in solid while the insert sequence (4.4 kb) is shown dashed. 

Figure 4 shows the results of SDS-PAGE performed on supernatants 
from cultures transformed with pBS42 and pS4. 

30 

Figure 5 shows the construction of the shuttle vector pBS42. 

Figure 6 shows a restriction map for a sequence including the B. 
subtil is subtil isin gene. 

35 
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Figure 7 is the sequence of a functional B. subtil is subtil isin 
gene. 

Figure 8 demonstrates a construction method for obtaining a 
deletion mutant of a B. subtil is subtil isin gene. 

Figure 9 discloses the restriction map for a li. subtil is neutral 
protease gene. 

Figure 10 is the nucleotide sequence for a subtil is neutral 
protease gene. 

Figure 11 demonstrates the construction of a vector containing a 
JL- subtil is neutral protease gene. 

Figures 12, 13 and 16 disclose embodiments of the mutagenesis 
technique provided herein. 

Figure 14 shows the enhanced oxidation stability of a subtilisin 
mutant. 

Figure 15 demonstrates a change in the pH-activity profile of a 
subtilisin mutant when compared to the wild type enzyme. 

Detailed Description 

Procaryotic carbonyl hydrolases are enzymes which hydrolyze 

0 
ii 

compounds containing C-X bonds in which X is oxygen or nitrogen. 
They principally include hydrolases, e.g. lipases and peptide 
hydrolases, e.g. subtilisins or metal loproteases. Peptide 
hydrolases include a-aminoacyl peptide hydrolase, peptidylamino-acid 
hydrolase, acyl amino hydrolase, serine carboxypeptidase, 
metal locarboxypeptidase, thiol proteinase, carboxyl proteinase and 

0992Y 
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metalloproteinase. Serine, metallo, thiol and acid proteases are 
included, as well as endo and exo-proteases. 

Subtil isins are serine proteinases which generally act to cleave 
internal peptide bonds of proteins or peptides, Metalloproteases 
are exo- or endoproteases which require a metal ion cofactor for 
activity. 

A number of naturally occurring mutants of subtil isin or neutral 
protease exist, and all may be employed with equal effect herein as 
sources /or starting genetic material. 

These enzymes and their genes may be obtained from many 
procaryotic organisms. Suitable examples include gram negative 
organisms such as E. coli or pseudomonas and gram positive bacteria 
such as micrococcus or bacillus. 

The genes encoding the carbonyl hydrolase may be obtained in 
accord with the general method herein. As will be seen from the 
examples, this comprises synthesizing labelled probes having 
putative sequences encoding regions of the hydrolase of interest, 
preparing genomic libraries from organising expressing the 
hydrolase, and screening the libraries for the gene of interest by 
hybridization to the probes. Positively hybridizing clones are then 
mapped and sequenced. The cloned genes are ligated into an 
expression vector (which also may be the cloning vector) with 
requisite regions for replication in the host, the plasmid 
transfected into a host for enzyme synthesis and the recombinant 
host cells cultured under conditions favoring enzyme synthesis, 
usually selection pressure such as is supplied by the presence of an 
antibiotic, the resistance to which Is encoded by the vector. 
Culture under these conditions results in enzyme yields multifold 
greater than the wild type enzyme synthesis of the parent organism, 
even if it is the parent organism that is transformed. 
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"Expression vector" refers to a DNA construct containing a DNA 
sequence which is operably linked to a suitable control sequence 
capable of effecting the expression of said DNA in a suitable host. 
Such control sequences include a promoter to effect transcription, 
5 an optional operator sequence to control such transcription, a 
sequence encoding suitable mRNA ribosome binding sites, and 
sequences which control termination of transcription and 
translation. The vector may be a plasmid, a phage particle, or 
simply a potential genomic insert. Once transformed into a suitable 

10 host, the vector may replicate and function independently of the 
host genome, or may, in some instances, integrate into .the genome 
itself. In the present specification, "plasmid" and "vector" are 
sometimes used interchangeably as the plasmid is the most commonly 
used form of vector at present. However, the invention 

1 5 is intended to include such other foras of expression vectors which 
serve equivalent functions and which are, or become, known in the 
art. 

"Recombinant host cells" refers to cells which have been 
20 transformed or transfected with vectors constructed using recombinant 
DNA techniques. As relevant to the present invention, recombinant 
host cells are those which produce procaryotic carbonyl hydrolases 
in its various forms by virtue of having been transformed with 
expression vectors encoding the r nroteins. The recombinant host 
25 cells may or may not have produce^ a form of carbonyl hydrolase 
prior to transformation. 

"Operably linked" when describing the relationship between two 
DNA regions simply means that they are functionally related to each 

30 other. For example, a presequence is operably linked to a peptide 
if it functions as a signal sequence, participating in the secretion 
of the mature form of the protein most probably involving cleavage 
of the signal sequence. A promoter is operably linked to a coding 
sequence if it controls the transcription of the sequence; a 

35 ribosome binding site is operably linked to a coding sequence if it 
is positioned so as to permit translation. 
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"Prohydrolase" refers to a hydrolase which contains additional 
N-terminal amino acid residues which render the enzyme inactive but, 
when removed, yield an enzyme. Many proteolytic enzymes are found 
in nature as translational proenzyme products and, in the absence of 
5 post-translational products, are expressed in this fashion. 

"Presequence" refers to a signal sequence of amino acids bound 
to the N-terminal portion of the hydrolase which may participate in 
the secretion of the hydrolase. Presequences also may be modified 

10 in the same fashion as is described here, including the introduction 
of predetermined mutations. When bound to a hydrolase, the subject 
protein becomes a "prehydrolase". Accordingly, relevant 
prehydrolase for the purposes herein are presubtil isin and 
preprosubtilisin. Prehydrolases are produced by deleting the "pro" 

15 sequence (or at least that portion of the pro sequence that 

maintains the enzyme in its inactive state) from a prepro coding 
region, and then expressing the prehydrolase. In this way the 
organism excretes the active rather than proenzyme. 

20 Th e cloned carbonyl hydrolase is used to transform a host cell 

in order to express the hydrolase. This will be of interest where 
the hydrolase has commercial use in its unmodified form, as for 
example subtil isin in laundry products as noted above. In the 
preferred embodiment the hydrolase gene is ligated into a high copy 

25 number plasmid. This plasmid replicates in hosts in the sense that 
it contains the well-known elements necessary for plasmid 
replication: a promoter operably linked to the gene in question 
{which may be supplied as the gene's own homologous promotor if it 
is recognized, i.e., transcribed, by the host), a transcription 

30 termination and polyadenylation region (necessary for stability of 
the mRNA transcribed by the host from the hydrolase gene) which is 
exogenous or is supplied by the endogenous terminator region of the 
hydrolase gene and, desirably, a selection gene such as an 
antibiotic resistance gene that enables continuous cultural 

35 maintenance of plasmf d-infected host cells by growth in 
0992Y 



-13- 



0130756 



antibiotic-containing media. High copy number plasmids also contain 
an origin of replication for the host, thereby enabling large 
numbers of plasmids to be generated in the cytoplasm without 
chromosonal limitations. However, it is within the scope herein to 
5 integrate multiple copies of the hydrolase gene into host genome. 
This is facilitated by oacterial strains which are particularly 
susceptible to homologous recombination. The resulting host cells 
are termed recombinant host cells* 

10 Once the carbonyl hydrolase gene has been cloned, a number of 
modifications are undertaken to enhance the use of the gene beyond 
synthesis of the wild type or precursor enzyme. A precursor enzyme 
is the enzyme prior to its modification as described in this 
application. Usually the precursor is the enzyme as expressed by 

15 the organism which donated the DNA modified in accord herewith. The 
term "precursor" is to be understood as not implying that the 
product enzyme was the result of manipulation of the precursor 
enzyme per se» 

20 In the first of these modifications, the gene may be deleted 
from a recombination positive (rec + ) organism containing a 
homologous gene. This is accomplished by recombination of an in 
vvtro deletion mutation of the cloned gene with the genome of the 
organism. Many strains of organisms such as E» co1i and Bacillus are 

25 known to be capable of recombination. All that is needed is for 
regions of the residual DNA from the deletion mutant to recombine 
with homologous regions of the candidate host. The deletion may be 
within the coding region (leaving enzymatically inactive 
polypeptides) or include the entire coding region as long as 

30 homologous flanking regions (such as promoters or termination 
regions) exist in the host. Acceptability of the host for 
recombination deletion mutants is simply determined by screening for 
the deletion of the transformed phenotype. This is most readily 

accomplished in the case of carbonyl hydrolase by assaying host 
35 cultures for loss of the ability to cleave a chromogenic substrate 
otherwise hydrolyzed by the hydrolase. 
0992Y 
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Transformed hosts contained the protease deletion mutants are 
useful for synthesis of products which are incompatible with 
proteolytic enzymes. These hosts by definition are incapable of 
excreting the deleted proteases described herein, yet are 
5 substantially normally sporulating. Also the other growth 

characteristics of the transformants are substantially like the 
parental organism. Such organisms are useful in that it is expected 
they will exhibit comparatively less inactivation of heterologous 
proteins than the parents, and these hosts do have growth 
10 characteristics superior to known protease-deficient organisms. 
However,- the deletion of neutral protease and subtilisin as 
described in this application does not remove all of the proteolytic 
activity of Bacillus. It is believed that intracellular proteases 
which are not ordinarily excreted extracellularly "leak" or diffuse 
15 from the cells during late phases of the culture. These 

intracellular proteases may or may not be subtilisin or neutral 
protease as those enzymes are defined herein. Accordingly, the 
novel Bacillus strains herein are incapable of excreting the 
subtilisin and/or neutral protease enzymes which ordinarily are 
.20 excreted extracellularly in the parent strains. "Incapable" means 
net revertible to the wild type. Reversion is a finite probability 
that exists with the heretofore known protease-deficient, naturally 
occurring strains since there is no assurance that the phenotype of 
such strains is not a function of a readily revertible mutation, 
25 e.g. a point mutation. This to be contrasted with the extremely 
large deletions provided herein. 

The deletion mutant-transformed host cells herein are free of 
genes encoding enzymatically active neutral protease or subtilisin, 
30 which genes are defined as those being substantially homologous with 
the genes set forth in Figs. 1, 7 or 10. "Homologous" genes contain 
coding regions capable of hybridizing under high stringency 
conditions with the genes shown in Figs. 1, 7 or 10. 
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The microbial strains containing carbonyl hydrolase deletion 
mutants are useful in two principal processes. In one embodiment 
they are advantageous in the fermentative production of products 
ordinarily expressed by a host that are desirably uncontaminated 
5 with the protein encoded by the deletion gene. An example is 
fermentative synthesis of amylase, where contaminant proteases 
interfere in many industrial uses for amylase. The novel strains 
herein relieve the art from part of the burden of purifying such 
products free of contaminating carbonyl hydrolases. 

10 

In a- second principal embodiment, subtil isin and neutral 
protease deletion-mutant strains are useful in the synthesis of 
protein which is not otherwise encoded by the strain. These 
proteins will fall wjthin one of two classes. The first class 

15 consists cf proteins encoded by genes exhibiting no substantial 
pretrans formati on homology with those of the host. These may be 
proteins from other procaryotes but ordinarily are eucaryotic 
proteins from yeast or higher eucaryotic organisms, particularly 
mammals. The novel strains herein serve as useful hosts for 

20 expressible vectors containing genes encoding such proteins because 
the probability for proteolytic degradation of the expressed, 
non-homologous proteins is reduced. 

The second group consists of mutant host genes exhibiting 
25 substantial pretransfonnation homology with those of the host. 

These include mutations of procaryotic carbonyl hydrolases such as 
subtilisin and neutral protease, as well as microbial (rennin, for 
example rennin from the genus Mucor). These mutants are selected in 
order to improve the characteristics of the precursor enzyme for 
30 industrial uses. 

A novel method is provided to facilitate the construction and 
identification of such mutants. First, the gene encoding the 
hydrolase is obtained and sequenced in whole or in part. Then the 
35 sequence is scanned for a point at which it is desired to make a 
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mutation (deletion, insertion or substitution) of one or more amino 
acids in the expressed enzyme. The sequences flanking this point 
are evaluated for the presence of restriction sites for replacing a 
short segment of the gene with an oligonucleotide pool which when 
expressed will encode various mutants. Since unique restriction 
sites are generally not present at locations within a convenient 
distance from the selected point (from 10 to 15 nucleotides), such 
sites are generated by substituting nucleotides in the gene in such 
a fashion that neither the reading frame nor the amino acids encoded 
are changed in the final construction. The task of locating 
suitable flanking regions and evaluating the needed changes to 
arrive at two unique restriction site sequences is made routine by 
the redundancy of the genetic code, a restriction enzyme map of the 
gene and the large number of different restriction enzymes. Mote 
that if a fortuitous flanking unique restriction site is available, 
the above method need be used only in connection with the flanking 
region which does not contain a site. 

Mutation of the gene in order to change its sequence to conform 
to the desired sequence is accomplished by M13 primer extension in 
accord with generally known methods. Once the gene is cloned, it is 
digested with the unique restriction enzymes and a plurality of end 
termini-complementary oligonucleotide cassettes are ligated into the 
unique sites. The mutagenesis is enormously simplified by this 
method because all of the oligonucleotides can be synthesized so as 
to have the same restriction sites, and no synthetic linkers are 
necessary to create the restriction sites. 

The number of commercially available restriction enzymes having 
sites not present in the gene of interest is generally large. A 
suitable DNA sequence computer search program simplifies the task of 
finding potential 5* and 3" unique flanking sites. A primary 
constraint is that any mutation introduced in creation of the 
restriction site must be silent to the final constructed amino acid 
coding sequence. For a candidate restriction site 5' to the target 
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codon a sequence must exist in the gene which contains at least all 
the nucleotides but for one in the recognition sequence 5' to the 
cut of the candidate enzyme. For example, the blunt cutting enzyme 
Snal (CCC/GGG) would be a 5* candidate if a nearby 5' sequence 
contained NCC, CNC, or CCN. Furthermore, if N needed to be altered 
to C this alteration must leave the amino acid coding sequence 
intact. In cases where a permanent silent mutation is necessary to 
introduce a restriction site one may want to avoid the introduction 
of a rarely used codon. A similar situation for Smal would apply 
for 3' flanking sites except the sequence NGG, GMG, or GGN must 
exist. -The criteria for locating candidate enzymes is most relaxed 
for blunt cutting enzymes and most stringent for 4 base overhang 
enzymes. In general many candidate sites are available. For the 
codon-222 target described herein a Ball site (TGG/CCA) could have 
been engineered in one base pair 5' from the Kpnl site. A 3' EcoRV 
site (GAT/ATC) could have been employed 11 base pairs 5' to the PstI 
site. A cassette having termini ranging from a blunt end up to a 
four base-overhang will function without difficulty. In retrospect, 
this hypothetical EcoRV site would have significantly shortened the 
oligonucleotide cassette employed (9 and 13 base pairs), thus 
allowing greater purity and lower pool bias problems. Flanking 
sites should obviously be chosen which cannot themselves ligate so 
that ligation of the oligonucleotide cassette can be assured in a 
single orientation. 

The mutation p_er se need not be predetermined. For example, an 
oligonucleotide cassette or fragment is randomly mutagenized with 
nitrosoguanidine or other mutagen and then in turn ligated into the 
hydrolase gene at a predetermined location. 

The mutant carbonyl hydrolases expressed upon transformation of 
the suitable hosts are screened for enzymes exhibiting desired 
characteristics, e.g. substrate specificity, oxidation stability, 
pH-activity profiles and the like. 
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A change in substrate specificity is defined as a difference 
between the Kcat/Km ratio of the precursor enzyme and that of the 
mutant. The Kcat/Km ratio is a measure of catalytic efficiency. 
Procaryotic carbonyl hydrolases with increased or diminished Kcat/Km 
5 ratios are described in the examples- Generally, the objective will 
be to secure a mutant having a greater (numerically larger) Kcat/Km 
ratio for a given substrate, thereby enabling the use of the enzyme 
to more efficiently act on a target substrate. An increase in 
Kcat/Km ratio for one substrate may be is accompanied by a reduction 
10 in Kcat/Km ratio for another substrate. This is a shift in 
substrate specificity, and mutants exhibiting such shifts have 
utility where the precursors are undesirable, e.g. to prevent 
undesired hydrolysis of a particular substrate in an admixture of 
substrates. 

15 

Kcat and Km are measured in accord with known procedures, or as 
described in Example 18. 

Oxidation stability is a further objective which is accomplished 
20 by mutants described in the examples. The stability may be enhanced 
or diminished as is desired for various uses. Enhanced stability is 
effected by deleting one or more methionine, tryptophan, cysteine or 
lysine residues and, optionally, substituting another amino acid 
residue not one of methionine, tryptophan, cysteine or lysine. The 
25 opposite substitutions result in diminished oxidation stability. 
The substituted residue is preferably alanyl, but neutral residues 
also are suitable. 

Mutants are provided which exhibit modified pH-activity 
30 profiles. A pH-activity profile is a plot of pH against enzyme 
activity and may be constructed as illustrated in Example 19 or by 
methods known in the art. It may be desired to obtain mutants with 
broader profiles, i.e., those having greater activity at certain pH 
than the precursor, but no significantly greater activity at any pH, 
35 or mutants with sharper profiles, i.e. those having enhanced 
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activity when compared to the precursor at a given pH, and lesser 
activity elsewhere. 

The foregoing mutants preferably are made within the active site 
5 of the enzyme as these mutations are most likely to influence 
activity. However, mutants at other sites important for enzyme 
stability or conformation are useful. In the case of Bacillus 
subtil i sin or its pre, prepro and pro forms, mutations at tyrosine-1, 
aspartate+32, asparagine+155 f tyrosine+104, methionine+222, 

10 glycine+166, histidine+64, glycine+169, phenylalanine+189, serine+33, 
serine+221, tyrosine+217, glutamate+156 and/or alanine+152 produce 
mutants having changes in the characteristics described above or in 
the processing of the enzyme. Note that these amino acid position 
numbers are those assigned to B. amyloliguefaciens subtil isin as 

15 seen from Fig. 7. It should be understood that a deletion or 
insertion in the N-terminal direction from a given position will 
shift the relative amino acid positions so that a residue will not 
occupy its original or wild type numerical position. Also, allelic 
differences and the variation among various procaryotic species will 

20 result in positions shifts, so that position 169 in such subtilisins 
will not be occupied by glycine. In such cases the new positions 
for glycine will be considered equivalent to and embraced within the 
designation glycine+169. The new position for glycine+169 is 
readily identified by scanning the subtil isin in question for a 

25 region homologous to glycine+169 in Fig. 7. 

One or more, ordinarily up to about 10, amino acid residues may 
be mutated. However, there is no limit to the number of mutations 
that are to be made aside from conmercial practicality. 

30 

The enzymes herein may be obtained as salts. It is clear that 
the ionization state of a protein will be dependent on the pH of the 
surrounding medium, if it is in solution, or of the solution from 

which it is prepared, if it is in solid form. Acidic proteins are 

35 

commonly prepared as, for example, the ammonium, sodium, or 
0992Y 



n 



r > a r o 
. J J J 



-20- 



0130756 



potassium salts; basic proteins as the chlorides, sulfates, or 
phosphates. Accordingly, the present application includes both 
electrically neutral and salt forms of the designated carbonyl 
hydrolases, and the term carbonyl hydrolase refers to the organic 
5 structural backbone regardless of ionization state. 

The mutants are particularly useful in the food processing and 
cleaning arts. The carbonyl hydrolases, including mutants, are 
produced by fermentation as described herein and recovered by 

10 suitable techniques. See for example K. Anstrup, 1974, Industrial 
Aspects of Biochemistry , ed. B. Spencer pp. 23-46. They are 
formulated with detergents or other surfactants in accord with 
methods known per se for use in industrial processes, especially 
laundry. In the latter case the enzymes are combined with 

15 detergents, buil**^, bleach and/or fluorescent whitening agents as 
is known in the art for proteolytic enzymes. Suitable detergents 
include linear al<yl benzene sulfonates, alkyl ethoxylated sulfate, 
sulfated linear alcohol or ethoxylated linear alcohol. The 
compositions may ^e formulated In granular or liquid form. See for 
20 example U.S Patents 3,623,957; 4,404,128; 4,381,247; 4,404,115; 
4,318,818; 4,261,868; 4,242,219; 4,142,999; 4,111,855; 4,011,169; 
4,090,973; 3,985,686; 3,790,482; 3,749,671; 3,560,392; 3,558,498; 
and 3,557,002. 

25 The following disclosure is intended to serve as a 

representation of embodiments herein, and should not be construed as 
limiting the scope of this application. 

30 Glossary of Experimental Manipulations 

In order to simplify the Examples certain frequently occurring 
methods will be referenced by shorthand phrases. 



-21- 



0130756 



Plasmids are designated by a small p preceeded and/or followed 
by capital letters and/or numbers. The starting plasmids herein are 
commercially available, are available on an unrestricted basis, or 
can be constructed from such available plasmids in accord with 
5 published procedures. 

"Klenow treatment" refers to the process of filling a recessed 
3' end of double stranded DNA with deoxyribonucleotides 
complementary to the nucleotides making up the protruding 5' end of 

10 the DNA strand. This process is usually used to fill in a recessed 
end resulting from a restriction enzyme cleavage of DNA. This 
creates a blunt or flush end, as may be required for further 
ligations. Treatment with Klenow is accomplished by reacting 
(generally for 15 minutes at 15*C) the appropriate complementary 

15 deoxyribonucleotides with the DMA to be filled in under the 

catalytic activity (usually 10 units) of the Klenow fragment of 
E. coli DNA polymerase I ("Klenow" ) . Klenow and the other reagents 
needed are commercially available. The procedure has been published 
extensively. See for example T. Maniatis et aK, 1982, Molecular 

20 Cloning , pp. 107-108. 

"Digestion" of DNA refers to catalytic cleavage of the DNA with 
an enzyme that acts only at certain locations in the DNA. Such 
enzymes are called restriction enzymes, and the sites for which each 

25 is specific is called a restriction site. "Partial" digestion 
refers to incomplete digestion by a restriction enzyme, i.e., 
conditions are chosen that result in cleavage of some but not all of 
the sites for a given restriction endonuelease in a DNA substrate. 
The various restriction enzymes used herein are commercially 

30 available and their reaction conditions, cof actors and other 
requirements as established by the enzyme suppliers were used. 
Restriction enzymes commonly are designated by abbreviations 
composed of a capital letter followed by other letters and then, 
generally, a number representing the microorganism from which each 

35 restriction enzyme originally was obtained. In general, about 1 ug 
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of plasmid or DNA fragment is used with about 1 unit of enzyme in 
about 20 pi of buffer solution. Appropriate buffers and substrate 
amounts for particular restriction enzymes are specified by the 
manufacturer. Incubation times of about 1 hour at 37°C are 
5 ordinarily used, but may vary in accordance with the supplier's 
instructions. After incubation, protein is removed by extraction 
with phenol and chloroform, and the digested nucleic acid is 
recovered from the aqueous fraction by precipitation with ethanol. 
Digestion with a restriction enzyme infrequently is followed with 

10 bacterial alkaline phosphatase hydrolysis of the terminal 5 1 

phosphates to prevent the two restriction cleaved ends of a DNA 
fragment from "circularizing" or forming a closed loop that would 
impede insertion of another DNA fragment at the restriction site. 
Unless otherwise stated, digestion of plasmids is not followed by 5' 

15 terminal dephosphorylation. Procedures and reagents for 

dephosphorylation are conventional (T. Maniatis et al_. , Id., 
pp. 133-134). 

"Recovery" or "isolation" of a given fragment of DNA from a 
20 restriction digest means separation of the digest on 6 percent 

polyacryl amide gel electrophoresis, identification of the fragment 
of interest by molecular weight {using DMA fragments of known 
molecular weight as markers), removal of the gel section containing 
the desired fragment, and separation of the gel from DMA. This 
25 procedure is known generally. For example, see R. Lawn et al * » 
1981, "Nucleic Acids Res." 9:6103-6114, and D. Goeddel et aK, 
(1980) "Nucleic Acids Res." 8:4057. 

"Southern Analysis" is a method by which the presence of DNA 
30 sequences in a digest or DNA-containing composition is confirmed by 
hybridization to a known, labelled oligonucleotide or DNA fragment. 
For the purposes herein, Southern analysis shall mean separation of 
digests on 1 percent agarose and depurination as described by 
G. Wahl et aK, 1979, "Proc. Nat. Acad. Sci. U.S.A." 76:3683-3687, 
35 transfer to nitrocellulose by the method of E. Southern, 1975, 
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"J. Nol. Biol." 98:503-517, and hybridization as described by 
T. Maniatis et al-, 1978, M Cell n 15:687-701. 

" Transformation" means introducing DNA into an organism so that 
5 the DNA is replicable, either as an extrachromosomal element or 
chromosomal integrant; Unless otherwise stated, the method used 
herein for transformation of E. coli is the CaCl^ method of Mandel 
et aH., 1970, "J, Mol . Biol." 53:154, and for Bacillus, the method 
of Anagnostopolous et al_. , 1961, "J. Bact." 81^:791-746. 

10 

"Ligation" refers to the process of forming phosphodiester bonds 
between two double stranded nucleic acid fragments (T. Maniatis 
et aH., Id., p. 146). Unless otherwise stated, ligation was 
accomplished using known buffers and conditions with 10 units of T4 
15 DMA ligase ("ligase") per 0.5 ug of approximately equimolar amounts 
of the DNA fragments to be li gated. Plasmids from the transformants 
were prepared, analyzed by restriction mapping and/or sequenced by 
the method of Messing, et al/, 1981, "Nucleic Acids Res. w , 9:309. 

20 "Preparation" of DNA from transformants means isolating plasmid 

DNA from microbial culture. Unless otherwise stated, the 
alkaline/SDS method of Maniatis et aU, Id. p. 90., was used. 

"Oligonucleotides" are short length single or double stranded 
25 polydeoxynucleotides Which were chemically synthesized by the method 
of Crea et al_., 1980, "Nucleic Acids Res." 8:2331-2348 (except that 
mesitylene nitrotriazole was used as a condensing agent) and then 
purified on polyacryl amide gels. 

30 All literature citations are expressly incorporated by reference. 
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Example 1 

Preparation of a Genomic ONA Library from B. amylol iquifaciens 
and Isolation of its Subtilisin Gene 

5 

The known amino acid sequence of the extracellular 
fJ. amyloliquefaciens permits the construction of a suitable probe 
mixture. The sequence of the mature subtilisin is included (along 
with the additional information contributed by the present work) in 
iq Figure 1. All codon ambiguity for the sequence of amino acids at 
position. 117 through 121 is covered by a pool of eight 

oligonucleotides of the sequence AA(y)AA(y)ATGGA(y)GT. 

Chromosomal DNA isolated from amylol iguefaciens (ATCC Mo. 

23844) as described by J - Marmur, "J. Mol , Biol.", 3:208, was 
15 " 

partially digested by Sau 3A, and the fragments size selected and 

ligated into the BamH 1 site of dephosphorylated pBS42. (pBS42 is 

shuttle vector containing origins of replication effective both in 

£. coli and Bacillus. It is prepared as described in Example 4.) 

The Sau3A fragment containing vectors were transformed into E. col i 
20 ~" 
K12 strain 294 {ATCC No. 31446) according to the method of M. 

Mandel, et aK, 1970, M J. Mol . Bio." 53: 154 using 80-400 nanograms 

of library DNA per 250pL of competent cells. 



Cells from the transformation mixture were plated at a 
density of 1-5 x 10 3 transformants per 150mm plate containing LB 
medium + 12.5 jig/ml chloramphenicol, and grown overnight at 37*C 
until visible colonies appeared. The plates were then replica 
plated onto BA85 nitrocellulose filters overlayed on LB/chloram- 
phenicol plates. The!' replica plates were grown 10-12 hours at 37°C 
and the filters transferred to fresh plates containing LB and 
150 yg/ml spectinomycin to amplify the plasmid pool. 



35 



After overnight incubation at 37*C, filters were processed 

essentially as described by Grunstein and Hogness, 1975, "Proc. 
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Natl. Acad. Sci. (USA)" 72: 3961. Out of approximately 20,000 
successful transformants, 25 positive colonies were found. Eight of 
these positives were streaked to purify individual clones. 24 
clones from each streak were grown in microti ter wellc- stamped on 
5 to two replica filters, and probed as described above with either 

AACAA(£)ATGGA(£)GT(pool 1) or AATAA{y)ATGGA(y )GT(pool 2) which differ 

by only one nucleotide. As shown in Figure 2, pool 1 hybridized to a 
much greater extent to all positive clones than did pool 2, suggesting 
1Q specific hybridization. 

Four out of five miniplasmid preparations (Maniatis et al . , 
Id.) from positive clones gave identical restriction digest patterns 
when digested with Sau3A or Hindi. The plasmid isolated from one of 
15 these four identical colonies by the method of Maniatis et aK, Id,, 
had the entire correct gene sequence and was designated pS4. The 
characteristics of this plasmid as determined by restriction analysis 
are shown in Figure 3. 

20 

Example 2 
Expression of the Subtilisin Gene 



25 Bacillus subtlHs 1-168 (Catalog No. 1-A1, Bacillus Genetic 

Stock Center) was transformed with pS4 and and a single chloramphenicol 
resistant transformant then grown 1n minimal medium. After 24 hours, 
the culture was centrifuged and both the supernatant (10-200 mD and 
pellet assayed for proteolytic activity by measuring the change in 

3Q absorbance per minute at 412 nm using 1 ml of the chromogenic substrate 
succinyl-L-ala-ala-pro-phe-p-nitroanil1de (0.2 uM) in 0.1M sodium 
phosphate (pH 8.0) at 25*C. A B. subtil 1s 1-168 culture transformed 
with pBS42 used as a control showed less than 1/200 of the activity 
shown by the pS4 transformed culture. Greater than 95 percent of the 

35 protease activity of the pS4 culture was present in the supernatant, 
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and was completely inhibited by treatment with phenyl methyl sulfonyl 
fluoride (PMSF) but not by EDTA. 

Aliquots of the supernatants were treated~with PMSF and EDTA to 
5 inhibit all protease activity and analyzed by 12 percent SDS-PAGE 

according to the method of Laemmli, U.K., 1970 "Nature", 2£7: 680. To 
prepare the supernatants, 16 pL of supernatant was treated with ImM 
PMSF, 10 m.M EDTA for 10 minutes, and boiled with 4 M L of 5x 
concentrated SDS sample buffer minus B-mercaptoethanol . The results 

10 of Coomassie stain on runs using supernatants of cells transformed 
with pS4*, pBS42, and un trans formed amyloliquefaciens are shown in 
Figure 4. Lane 3 shows authentic subtilisin f rom B_. amyloliquefaciens 
Lane 2 which is the supernatant from pBS42 transformed subtil is , 
does not give the 31,000 MW band associated with subtilisin which is 

15 exhibited by Lane 1 from pS4 transformed hosts. The approximately 
31,000 MW band result for subtilisin is characteristic of the slower 
mobility shown by the known M.W. 27,500 subtilisin preparations in 
general . 

20 

Example 3 

Sequencing of the B. amyloliquefaciens Subtilisin Gene 

25 The entire sequence of an EcoRI-BamHI fragment (wherein the EcoRI 
site was constructed by conversion of the Hindi site) of pS4 was 
determined by the method of F. Sanger, 1977, "Proc. Natl. Acad. Sci 
(USA)", 74:5463. Referring to the restriction map shown in Figure 3, 
the BamHI-PvuII fragment was found to hybridize with pool 1 

30 oligonucleotides by Southern analysis. Data obtained from sequencing 
of this fragment directed the sequencing of the remaining fragments 
(e.g. PvuII-HincII and Aval-Aval). The results are shown in Figure 1. 

Examination of the sequence confirms the presence of codons for 
35 the mature subtilisin corresponding to that secreted by the 
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1$. any 1 ol i quef ac i ens . Immediately upstream from this sequence is a 
series of 107 codons beginning with the GTG start codon at -107 • 
Codon -107 to approximately codon -75 encodes an amino acid sequence 
whose characteristics correspond to that of known signal sequences. 
5 (Most such signal sequences are 18-30 amino acids in length, have 
hydrophobic cores, and terminate in a small hydrophobic amino acid.) 
Accordingly, examination of the sequence data would indicate that 
codons -107 to approximately -75 encode the signal sequence; the 
remaining intervening codons between -75 and -1 presumably encode a 
10 prosequence. 

Example 4 

15 Construction of pBS42 

pBS42 is formed by three-way ligation of fragments derived from 
pUBHO. pC194, and pBR322 (see Figure 5). The fragment from pUBHO is 
the approximately 2600 base pair fragment between the Hpall site at 

20 1900 and the BamHl site at 4500 and contains an origin of replication 
operable in Bacillus: T. Grycztan, et aK, 1978 "J. Bacteriol 134 : 
318 (1978); A. Jalanko, et aK, 1981 "Gene". 14: 325. The BamHI site 
was tested with Klenow. The pBR322 portion is the 1100 base pair 
fragment between the PvuII site at 2067 and the Sau3A site at 3223 

25 which contains the E. coli origin of replication: F. Bolivar, et al 
1977 "Gene", 2: 95; J. Sutcliffe, 1978, Cold Spring Harbor Symposium 
43: I, 77. The pC194 fragment is the 1200 base pair fragment between 
the Hpall site at 973 and the Sau3A site at 2006 which contains the 
gene for chloramphenicol resistance expressible in both E_. col i and 

3b subtilis : S. Ehrlich, "Proc. Natl. Acad. Sci. (USA)", 74:1680; 
S. Horynuchi et aK, 1982, tt J. Bacteriol." 150: 815. 

pBS42 thus contains origins of replication operable both in Z. 
col 1 and in Bacillus and an expressible gene for chloramphenicol 
35 resistance. 
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Example 5 

Isolation and Sequencing of the B. subtil is Subtil isin Gene 

5 B. subtil is 1168 chromosomal DNA was digested with EcoRI and the 

fragments resolved on gel electrophoresis". A single 6 kb fragment 
hybridized to a [a- 32 P] CTP nick translation - labelled fragment 
obtained from the C-terminus of the subtil isin structural gene in pS4, 
described above. The 6 kb fragment was electroluted and ligated into 
10 pBS42 which had been digested with EcoRI and treated with bacterial 
alkaline- phosphatase. E. coli ATCC 31446 was transformed with the 
ligation mixture and transformants selected by growth on LB agar 
containing 12.5 M g chloramphenicol /ml . Plasmid DNA was prepared from 
a pooled suspension of 5,000 transformed colonies. This DNA was 
15 transformed into EL subtil is BG84. a protease deficient strain, the 
preparation of which is described in Example 8 below. ^Colonies which 
produced protease were screened by plating on LB agar plus 1.5 percent 
w/w Carnation powdered nonfat skim milk and 5 ug chloramphenicol /ml 
(hereafter termed skim milk selection plates) and observing for zones 
20 of clearance evidencing proteolytic activity. 

Plasmid DNA was prepared from protease producing colonies, 

digested with EcoRI, and examined by Southern analysis for the 

presence of the 6 kb EcoRI insert by hybridization to the 
32 

25 P-labelled C-terminus fragment of the subtillsin structural gene 
from B. amyloliquefadens . A positive clone was identified and the 
plasmid was designated pS168.1. B. subtil is BG84 transformed with 
pS168.1 excreted serine protease at a level 5-fold over that produced 
in B. subtil is 1168. Addition of EDTA to the supernatants did not 

30 affect the assay results, but the addition of PMSF 

(phenyl methyl sufonyl fluoride) to the supernatants reduced protease 
activity to levels undetectable In the assay described in Example 8 
for strain BG84. 
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A restriction map of the 6.5 kb EcoRI insert is shown in Fig. 6. 
The subtilisin gene was localized to within the 2.5 kb KpnI-EcoRI 
fragment by subcloning various restriction enzyme digests and testing 
for expression cf subtilisin in subtil is BG84. Southern analysis 
5 with the labelled fragment from the C-terminus of the 

B. amylol iquefaciens subtilisin gene as a probe localized the 
C-terminus of the subtil is gene to within or part of the 631 bp 
Hindi fragment B in the center of this subclone (see Fig. 6). The 
tandem Hindi fragments B f C, and D and HincII-EcoRI fragment E 

10 (Fig. 6) were ligated into the M13 vectors mp8 or mp9 and sequenced in 
known fashion (J. Messing et al_., 1982, "Gene" 19:209-276) using 
dideoxy chain termination (F. Sanger et al . , 1977, "Proc. Mat. Acad. 
Sci. U.S. A." 74:5463-5467). The sequence of this region is shown in 
Fig. 7. The first 23 amino acids are believed to be a signal 

15 peptide. The remaining 83 amino acids between the signal sequence and 
the mature coding sequence constitute the putative "pro M sequence. 
The overlined nucleotides at the 3' end of the gene are believed to be 
transcription terminator regions. Two possible Shine-Dalgarno 
sequences are underlined upstream from the mature start codon. 

20 

Example 6 

Manufacture of an Inactivating Mutation of the B. subtil is 
25 Subtilisin Gene 

A two step ligation, shown in Fig. 8, was required to construct a 
plasmid carrying a defective gene which would integrate into the 
Bacillus chromosome. In the first step, pS168.1, which contained the 

30 6.5 kb insert originally recovered from the subtil is genomic 

library as described in Example 5 above, was digested with EcoRI, the 
reaction products treated with Klenow, the ONA digested with Hindi, 
and the 800 bp EcoRI-HincII fragment E (see Fig. 6) that contains, in 
part, the 5' end of the 8. subtil is subtilisin gene, was recovered. 

35 This fragment was ligated into pJHIOl (pJHIOl is available from 
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J. Hoch (Scripps) and is described by F.A. Ferrari et aK , 1983, 
" "J. Bact." 134:318-329) that had been digested with Hindi and treated 
with bacterial alkaline phosphotase. The resultant plasmid, pIDYl, 
contained fragment E in the orientation shown in Fig. 8. In the 

5 second step, pS168.1 was digested with Hindi and the 700 bp Hindi 
fragment B, which contains the 3' end of the subtil isin gene, was 
recovered. pIDVl was digested at its unique Hindi site and 
fragment B ligated to the linearized plasmid, transformed in Z* coli 
ATCC 31,446, and selected on LB plates containing 12.5 ug 

10 chloramphenicol /ml or 20 ug arnpicill in/ml . One resulting plasmid, 
designated pI0V1.4, contained fragment B in the correct orientation 
with respect to fragment E. This plasmid pIDV1.4, shown in Fig. 8, is 
a deletion derivative of the subtil isin gene containing portions of 
the 5' and 3' flanking sequences as well. 

15 

£• subtil is BG77, a partial protease-def icient mutant {Prt + ^") 
prepared in Example 8 below was transformed with pIDV1.4. Two classes 
of chloramphenicol resistant (Cm r ) transformants were obtained. 
Seventy-five percent showed the same level of proteases as BG77 

20 (Prt + '") and 25 percent were almost completely protease deficient 

(Prt") as observed by relative zones of clearing on plates containing 
LB agar plus skim milk. The Cm r Prt" trans forma *,s could not be 
due to a single crossover integration of the plasmid at the homologous 
regions for fragment E or B because, in such a case, the gene would be 

25 uninterrupted and the phenotype would be Prt + ^~. In fact, when 

either of fragments E or B were ligated independently into pJHIOl and 
subsequently transformed into B. subtil is BG77, the protease deficient 
phenotype was not observed. The Cm r phenotype of Cm r Prt" 
pIDV1.4 transformants was unstable in that Cm s Prt" derivatives 

30 could be isolated from Cm r Prt" cultures at a frequency of about 
0.1 percent after 10 generations of growth in minimal medium in the 
absence of antibiotic selection. One such derivative was obtained and 
designated BG2018. The deletion was transferred into IA84 (a BGSC 
strain carrying two auxotrophic mutations flanking the subtil isin gene) 

35 by PBS1 transduction. The derivative organism was designated BG2019. 
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Example 7 

Preparation of a Genomic DHA Library from B. subtil is and 
Isolation of its Neutral Protease Gene 

5 

The partial amino acid sequence of a neutral protease of 
B. subtil is is disclosed by P. Levy et al^ 1975, "Proc. Nat. Acad. 
Sci. USA" 72:4341-4345. A region of the enzyme (Asp Gin Met He Tyr 
Gly) was selected from this published sequence in which the least 
10 redundancy existed in the potential codons for the amino acids in the 
region. * 24 combinations were necessary to cover all the potential 
coding sequences, as described below. 

GA : CA ~ ATG AT ' TA C GG 
15 C A £ 

Asp Gin Met lie Tyr Gly 

Four pools, each containing six alternatives, were prepared as 
20 described above in Example 1. The pools were labelled by 
phosphorylation with [^- 32 p] ATP. 

The labelled pool containing sequences conforming closest to a 
unique sequence in a B. subtil is genome was selected by digesting 

25 B. subtil is (1A72, Bacillus Genetic Stock Center) DNA with various 
restriction enzymes, separating the digests on an electrophoresis 
gel, and hybridizing each of the four probe pools to each of the 
blotted digests under increasingly stringent conditions until a 
single band was seen to hybridize. Increasingly stringent 

30 conditions are those which tend to disfavor hybridization, e.g., 
increases in formamide concentration, decreases in salt 
concentration and increases in temperature. At 37°C in a solution 
of 5x tenhardfs, 5x SSC, 50 mM NaP0 4 P H 6.8 and 20 percent 
formamide, only pool 4 would hybridize to a blotted digest. These 

35 were selected as the proper hybridization conditions to be used for 
the neutral protease gene and pool 4 was used as the probe. 
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A lambda library of B. subtil is strain BGSC 1-A72 was prepared 
In conventional fashion by partial digestion of the Bacillus genomic 
DNA by Sau3A, separation of the partial digest by molecular weight 
on an electrophoresis gel, elution of 15-20 kb fragments (R. Lav/n 
5 et aK, 1981, "Nucleic Acids Res." 9:6103-6114), and ligation of the 
fragments to BamHI digested charon 30 phage using a Packagene kit 
from Promega Biotec. 

E. coli DPSOsupF was used as the host for the phage library, 
10 although any known host for Charon lambda phage is satisfactory. 
The E. coli host was plated with the library phage and cultured, 
after which plaques were assayed for the presence of the neutral 
protease gene by transfer to nitrocellulose and screening with probe 
pool 4 (Benton and Davis, 1977, -Science" 196:180-182). Positive 
15 plaques were purified through two rounds of single plaque 
purification, and two plaques were chosen for further study, 
designated xNPRGl and xNPRG2. DNA was prepared from each phage by 
restriction enzyme hydrolysis and separation on electrophoresis 
gels. The separated fragments were blotted and hybridized to 
20 labelled pool 4 oligonucleotides. This disclosed that xNPRGl 

contained a 2400 bp Hindi 1 1 hybridizing fragment, but no 4300 EcoRI 
fragment, while XNPRG2 contained a 4300 bp EcoRI fragment, but no 
2400 bp Hindi 1 1 fragment. 

25 The 2400 bp xNPRGl fragment was subcloned into the Hindi 1 1 site 

of pJHIOl by the following method. xNPRGl was digested by Hindlll, 
the digest fractionated by electrophoresis and the 2400 bp fragment 
recovered from the gel. The fragment was ligated to alkaline 
phosphatase-treated Hindlll digested pJHIOl and the ligation mixture 

30 used to transform E. coli ATCC 31446 by the calcium chloride shock 
method of V- Hershf ield et al_. , 1974, "Proc. Nat. Acad. Sci. 
(U.S.A. ) u 79:3455-3459). Transfonnants were identified by selecting 
colonies capable of growth on plates containing LB medium plus 
12.5 yg chloramphenicol /ml . 
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Transformant colonies yielded several plasmids. The orientation 
of the 2400 bp fragment in each plasmid was determined by 
conventional restriction analysis (orientation is the sense reading 
or transcriptional direction of the gene fragment in relation to the 
5 reading direction of the expression vector into which it is 

li gated.) Two plasmids with opposite orientations were obtained and 
designated pNPRsubH6 and pNPRsubHl. 

The 4300 bp EcoRI fragment of xNPRG2 was subcloned into pBR325 
10 by the method described above for the 2400 bp fragment except that 
xNPRG2 was digested with EcoRI and the plasmid was alkaline 
phosphatase-treated, EcoRI-digested pBR325. pBR325 is described by 
F. Bolivar, 1978, "Gene" 4:121-136. Two plasmids were identified in 
which the 4300 bp insert was present in different orientations. 
15 These two plasmids were designated pNPRsubRI and pNPRsubRIb. 

Example 8 

20 Characterization of B. subtil is Neutral Protease Gene 

The pNPRsubHl insert was sequentially digested with different 
restriction endonucleases and blot hybridized with labelled pool 4 
in order to prepare a restriction map of the insert (for general 

25 procedures of restriction mapping see T. Naniatis et aK, Id., 
p. 377). A 430 bp Rsal fragment was the smallest fragment that 
hybridized to probe pool 4. The Rsal fragment was li gated into the 
Smal site of M13 mp8 (J. Messing et al_., 1982, "Gene" 19:269-276 and 
o. Messing in Methods in Enzymology , 1983, R. Wu et aU , Eds., 

30 101 :20-78) and the sequence determined by the chain-terminating 
dideoxy method {F. Sanger et aK , 1977, "Proc. Nat. Acad. Sci. 
U.S.A." 74:5463-5467). Other restriction fragments from the 
pNPRsubHl insert were 1 {gated into appropriate sites in M13 mp8 or 
M13 mp9 vectors and the sequences determined. As required, dITP was 

35 used to reduce compression artifacts (D. Mills et aj_. » 1979, "Proc. 
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Nat. Acad. Sci. (U.S.A. )" 76:2232-2235). The restriction map for 
the pNPRsubHl fragment is shown in Fig, 9. The sequences of the 
various fragments from restriction enzyme digests were compared and 
an open reading frame spanning a codon sequence translatable into 
the amino and carboxyl termini of the protease (P. Levy et aK > Id.) 
was determined. An open reading frame is a DNA sequence commencing 
at a known point which in reacing frame (every three nucleotides) 
does not contain any internal tcnnination codons. The open reading 
frame extended past the amino terminus to the end of the 2400 bp 
Hindi 1 1 fragment. The 1300 bp Bglll - Hindi 1 1 fragment was prepared 
from pNPHsubRIb (which contained the 4300 bp EcoRI fragment of 
XNPRG2) and cloned in M13 mp8. The sequence of this fragment, which 
contained the portion of the neutral protease leader region not 
encoded by the 2400 bp fragment of pNPRsubHl, was determined for 400 
nucleotides upstream from the Hindi 1 1 site. 

The entire nucleotide sequence as determined for this neutral 
protease gene, including the putative secretory leader and prepro 
sequence, are shown in Fig. 10. The numbers above the line refer to 
amino acid positions. The underlined nucleotides in Fig. 10 are 
believed to constitute the ribosorae binding (Shine-Dalgarno) site, 
while the overlined nucleotides constitute a potential hairpin 
structure presumed to be a terminator. The first 27 - 28 of the 
deduced amino acids are believed to be the signal for the neutral 
protease, with a cleavage point at ala-27 or ala-28. The "pro" 
sequence of a proenzyme structure extends to the ami no-terminal 
amino acid (ala-222) of the mature, active enzyme. 

A high copy plasmid carrying the entire neutral protease gene 
was constructed by (Fig. 11) ligating the Bglll fragment of 
pNPRsubRl, which contains 1900 bp (Fig. 9), with the PvuII - Hindlll 
fragment of pNPRsubHl, which contains 1400 bp . pBS42 (from 
Example 4) was digested with BamHI and treated with bacterial 
alkaline phosphatase to prevent plasmid recircularization. 
pNPRsubRl was digested with Bglll, the 1900 bp fragment was isolated 
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from gel electrophoresis and ligated to the open BamHI sites of 
PBS42. The ligated plasmid was used to transform E. coli ATCC 31446 
by the calcium chloride shock method (V. Nershfield et al_. , Id.), 
and transformed cells selected by growth on plates containing LB 
5 medium with 12.5 M g/ml chloramphenicol. A plasmid having the Bgl II 
fragment in the orientation shown in Fig. 11 was isolated from the 
transformants and designated pNPRsubBl. pNPRsubBl was digested 
(linearized) with EcoRI, repaired to flush ends by Klenow treatment 
and then digested with Hindlll. The larger fragment from the 
10 Hindi I I digestion (containing the sequence coding for the amino 
terminal' and upstream regions) was recovered. 

The carboxyl terminal region of the gene was supplied by a 
fragment from pNPRsubHl, obtained by digestion of pNPRsubHl with 

15 PvuII and Hindi 1 1 and recovery of the 1400 bp fragment. The flush 
end PvuII and the Hindi 1 1 site of the 1400 bp fragment was ligated, 
respectively, to the blunted EcoRI and the Hindlll site of 
pNPRsubBl, as shown in Fig. 11. This construct was used to 
transform B_. subtil is strain BG84 which otherwise excreted no 

20 proteolytic activity by the assays described below. Transformants 
were selected on plates containing LB medium plus 1.5 percent 
carnation powdered nonfat milk and 5ug/ml chloramphenicol. Plasmids 
from colonies that cleared a large halo were analyzed. Plasmid 
pNPRlO, incorporating the structural gene and flanking regions of 

25 the neutral protease gene, was determined by restriction analysis to 
have the structure shown in Fig. 11. 

B. subtil is strain BG84 was produced by N-methyl-N'-nitro-N- 

nitrosoguanidine (NTG) mutagenesis of B. subtil is 1168 according to 

30 the general technique of Adelberg et aK. 1965. "Biochem. Biophys. 

Res. Commun." 18:788-795. Mutagenized strain 1168 was plated on 

skim milk plates (without antibiotic). Colonies producing a smaller 

halo were picked for further analysis. Each colony was 

characterized for protease production on skim milk plates and 

35 amylase production on starch plates. One such isolate, which was 
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partially protease deficient, amylase positive and capable of 
sporulation, was designated BG77. The protease deficiency mutation 
was designated prt-77. The prt-77 allele was moved to a spoOA 
background by congression as described below to produce strain BG84, 
5 a sporulation deficient strain. 



Table A 



10 Strain 



Relevant Genotype 



origin 



15 



1168 trpC2 

JH703 trpC2 , pheA12 , spoOAa677 

BG16 £urB6, metBS, 1euA8 t 1ys-21 . hisA , thr-5 

sacA321 

BG77 trpC2 , prt-77 

BG81 metBS , prt-77 

20 BG84 spoOA6>7 , prt-77 



Trousdale et al .a 
Pb 1665 

NTG x 1168 
BG16 DNA x BG77 
JH703 DNA x BG81 



a "Mol. Gen. Genetics" 173:61 (1979) 



25 BG84 was completely devoid of protease activity on skim milk 

plates and does not produce detectable levels of either subtilisin 
or neutral protease when assayed by measuring the change in 
absorbance at 412 mn per minute upon incubation with 0.2 yg/ml 
succinyl (-L-ala-L-ala-t-pro-L-phe) p-nitroanilide (Vega) in 0.1 M 

30 sodium phosphate, pH 8, at 25*C. BG84 was deposited in the ATCC as 
deposit number 39382 on July 21, 1983. Samples for subtilisin assay 
were taken from late logarithmic growth phase supernatants of 
cultures grown in modified Schaeffer's medium (T. Leighton et aK, 
1971, "J. Biol. Chem." 246:3189-3195). 

35 
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Example 9 

Expression of the Neutral Protease Gene . 

BG84 transformed with pMPRlO was inoculated into minimal media 
supplemented with 0.1 percent casein hydrolysate and 10 pg 
chloramphenicol and cultured for 16 hours. 0.1 ml of culture 
supernatant was removed and added to a suspension of 1.4 mg/ml 
Azocoll proteolytic substrate (Sigma) in 10 mM Tris-HCl, 100 mM MaCl 
pH 6.8 and incubating with agitation. Undigested substrate was 
removed -by centrifugation and the optical density read at 505 nm. 
Background values of an Azocoll substrate suspension were 
subtracted. The amount of protease excreted by a standard 
protease-expressing strain, BG16 was used to establish an arbitrary 
level of 100. The results with BG16, and with BG84 transformed with 
control and neutral protease gene-containing plasmids are shown in 
Table B in Example 12 below. Transformation of the excreted 
protease-devoid subtil is strain BG84 resul ts in excretion of 
protease activity at considerably greater levels than in BG16, the 
wild- type strain. 

Example 10 

Manufacture of an Inactivating Mutation of the Neutral Protease 
Gene 

The two Rsal bounded regions in the 2400 bp insert of pNPRsubHl, 
totalling 527 bp, can be deleted in order to produce an incomplete 
structural gene. The translational products of this gene are 
en^ymatically inactive- A plasmid having this deletion was 
constructed as follows- pJHIOl was cleaved by digestion with 
Hindi II and treated with bacterial alkaline phosphatase. The 
fragments of the neutral protease gene to be incorporated into 
linearized pJHIOl were obtained by digesting pNPRsubHl with Hindi II 
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and Rsal, and recovering the 1200 bp Hindlll-Rsal and 680 bp 
Rsal-Hindlll fragments by gel electrophoresis. These fragments were 
ligated into linearized pJHIOl and used to transform E. coli 
ATCC 31446. Transformants were selected on plates containing LB 
5 medium and 20 M g ampicill in/ml . Plasmids were recovered from the 
transformants and assayed by restriction enzyme analysis to identify 
a plasmid having the two fragments in the same orientation as in the 
pNPRsubHl starting plasmid. The plasmid lacking the internal Rsal 
fragments was designated pNPRsubHU. 

10 

Example 11 

Replacement of the Neutral Protease Gene with a Deletion Mutant 

15 

Plasmid pNPRsubhla was transformed into 0. subtil is strain 
BG2019 (the subtil i sin deleted mutant from Example 6) and 
chromosomal integrants were selected on skim milk plates. Two types 
of Cm r transformants were noted, those with parental levels of 

20 proteolysis surrounding the colony, and those with almost no zone of 
proteolysis. Those lacking a zone of proteolysis were picked, 
restreaked to purify individual colonies, and their protease 
deficient character on skim milk plates confirmed. One of the 
Cm r , proteolysis deficient colonies was chosen for further studies 

25 (designated BG2034). Spontaneous Cm s revertants of BG2034 were 
isolated by overnight growth in LB media containing no Cm, plating 
for individual colonies, and replica plating on media with and 
without Cm. Three Cm s revertants were isolated, two of which were 
protease proficient, one of which was protease deficient (designated 

30 BG2036). Hybridization analysis of BG2036 confirmed that the 

plasmid had been lost from this strain, probably by recombination, 
leaving only the deletion fragments of subtilisin and neutral 
protease. 

35 
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Example 12 

Phenotype of Strains Lacking Functional Subtilisin and Neutral 
Protease 

The growth, sporulation and expression of proteases was examined 
in strains lacking a functional gene for either the neutral or 
alkaline protease or both. The expression of proteases was examined 
by a zone of clearing surrounding a colony on a skim milk plate and 
by measurement of the protease levels in liquid culture supernatants 
(Table B). A strain (BG2035) carrying the subtilisin gene deletion, 
and showed a 30 percent reduction level of protease activity and a 
normal halo on milk plates. Strain BG2043, carrying the deleted 
neutral protease gene and active subtilisin gene, and constructed by 
transforming B616 (Ex. 8) with DNA from BG2036 (Example 11), showed 
an 80 percent reduction in protease activity and only a small halo 
on the milk plate. Strain BG2054, considered equivalent to BG2036 

Table B 

Effect of protease deletions on protease expression and sporulation. 



Genotype* Protease activity 0 Percent Sporulation 



BG16 

BG2035 

BG2043 

BG2054 

BG84(pBS42) 

BG84{pNPR10) 



Wild type 100 

apr&684 70 

nprEA522 20 

apr&684,nprEA522 ND 

spo0AA677,prt-77 ND 

spo0AA677,prt-77 3000 



40 
20 
20 
45 



a 0nly the loci relevant to the protease phenotype are shown. 

D Protease activity is espressed In arbitrary units, BG16 was assigned a 

level of 100. ND indicates the level of protease was not detectable in 

the assay used. 
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(Example 11) in that it carried the foregoing deletions in both 
genes, showed no detectable protease activity in this assay and no 
detectable halo on milk plates. The deletion of either or both of 
the protease genes had no apparent effect on either growth or 
5 sporulation. Strains carrying these deletions had normal growth 
rates on both minimal glucose and LB media. The strains sporulated 
at frequencies comparable to the parent strain BG16. Examination of 
morphology of these strains showed no apparent differences from 
strains without such deletions. 

10 

Example 13 

Site-specific Saturation Mutagenesis of the B. Amyloliquefaciens 
Subtilisin Gene at Position 222; Preparation of the Gene for 
15 Cassette Insertion 

pS4-5, a derivative of pS4 made according to Wells et al . , 
"Nucleic Acids Res.", 1983, U.:7911-7924 was digested with EcoRI and 
BamHI, and the 1.5 kb EcoRI-BamHI fragment recovered. This fragment 

20 was ligated into replicative form M-l' ro9 which had been digested 
with EcoRI and BamHI (Sanger et al_., 1980, M J. Mol . Biol." 143 
161-178. Messing etal, 1981, "Nucleic Acids Research" 9, 304-321. 
Messing, J. and Vieira, J. (1982) Gene 19, 269-276). The M-13 mp9 
phage ligations, designated M-13 mp9 SUBT, were used to transform 

25 coli strain JM101 and single stranded phage DNA was prepared from 
a two mL overnight culture. An oligonucleotide primer was 
synthesized having the sequence 

5' -GTACAACGGTACCTCACGCACGCTGCAGGAGCGGCTGC-3' . This primer conforms 
to the sequence of the subtilis gene fragment encoding amino acids 

30 216-232 except that the 10 bp of codons for amino acids 222-225 were 
deleted, and the codons for amino acids 220, 227 and 228 were 
mutated to introduce a Kpnl site 5* to the met-222 codon and a PstI 
site 3 1 to the roet+222 codon. See Fig. 12. Substituted nucleotides 
are denoted by asterisks, the underlined codons in line 2 represent 

35 the new restriction sites and the scored sequence in line 4 



0992Y 



-41- 



0130756 



represents the inserted oligonucleotides. The primer {about 15 pM) 
Was labelled with [ 32 p] by incubation with [y 32 p]-ATP (10 pL in 
20 pL reaction MAmersham 5000 Ci/mmol, 10218) and T 4 
polynucleotide kinase (10 units) followed by non-radioactive ATP 
5 (100 uM) to allow complete phosphorylation of the mutagenesis 

primer. The kinase was inactivated by heating the phosphorylation 
mixture at 68*0 for 15 min. 

The primer was hybridized to M-13 mp9 SUBT as modified from 

10 Norris et al_., 1983, "Nucleic Acids Res." 11. 5103-5112 by combining 
5 pL of the labelled mutagenesis primer (~3 pM), "1 pg M-13 mp9. SUBT 
template, 1 pL of 1 pM M-13 sequencing primer {17-mer) f and 2.5 pL 
of buffer (0.3 M Tris pH 8, 40 mM MgCl 2 , 12 mM EDTA, 10 mM DTT, 
0.5 mg/ml BSA). The mixture was heated to 68 # C for 10 minutes and 

15 cooled 10 minutes at room temperature. To the annealing mixture was 
added 3.6 M L of 0.25 mM dGTP, dCTP, dATP, and dTTP, 1.25 pL of 10 mM 
ATP, 1 P L ligase (4 units) and 1 uL Klenow (5 units). The primer 
extension and ligation reaction (total volume 25 pi) proceeded 
2 hours at 14°C. The Klenow and ligase were inactivated by heating 

20 to 68°C for 20 min. The heated reaction mixture was digested with 
BamHl and EcoRI and an aliquot of the digest was applied to a 6 
percent polyacryl amide gel and radioactive fragments were visualized 
by autoradiograpt\y. This showed the [ 32 P] mutagenesis primer had 
indeed been incorporated into the EcoRI -BamHl fragment containing 

25 the now mutated subtilisin gene. 

The remainder of the digested reaction mixture was diluted to 
200 U L with 10 mM Tris, pH 8, containing 1 mM EDTA, extracted once 
with a 1:1 (v:v) phenol /chloroform mixture, then once with 

30 chloroform, and the aqueous phase recovered. 15 pL of 5M ammonium 
acetate (pH 8) was added along with two volumes of ethanol to 
precipitate the ONA from the aqueous phase. The DMA was pelleted by 
centrifugation for five minutes in a microfuge and the supernatant 
was discarded. 300 pL of 70 percent ethanol was added to wash the 

35 Df«A pellet, the wash was discarded and the pe* "c* lyopbil ized. 
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pBS42 from example 4 above was digested with BamHl and EcoRI and 
purified on an acrylamide gel to recover the vector. 0.5ug of the 
digested vector, 50uM ATP and 6 units ligase were dissolved in 20 ul 
of ligation buffer. The ligation went overnight at 14°C. The DNA 
5 was transformed into E. col i 294 rec + and the transfortnants grown 
in 4 ml of LB medium containing 12.5 ug/ml chloramphenicol. Plasmid 
DNA was prepared from this culture and digested with Kpnl, EcoRI and 
BamHI. Analysis of the restriction fragments showed 30-50 percent 
of the molecules contained the expected Kpnl site programed by the 

1(J mutagenesis primer. It was hypothesized that the plasmid population 
not including the Kpnl site resulted from M-13 replication before 
bacterial repair of the mutagenesis site, thus producing a 
heterogenous population of Kpnl* and KpnT plasmids in some of 
the transformants. In order to obtain a pure culture of the Kpnl + 

15 plasmid, the DMA was transformed a second time into £• coli to clone 
plasmids containing the new Kpnl site. DNA was prepared from 16 
such transformants and six were found to contain the expected Kpnl 
site. 

20 Preparative amounts of DNA were made from one of these six 

transformants (designated p*222) and restriction analysis confirmed 
the presence and location of the expected Kpnl and PstI sites. 40 
ug of pA222 were digested in 300 nL of Kpnl buffer plus 30 uL Kpnl 
{300 units) for 1.5 h at 37 # C. The DNA was precipitated with 

25 ethanol, washed with 70 percent ethanol, and lyophilized. The DNA 
pellet was taken up in 200 U L Hindi 1 1 buffer and digested with 20 uL 
(500 units) PstI for 1.5 h at 37*C. The aqueous phase was extracted 
with phenol/CHCl 3 and the DNA precipitated with ethanol. The DNA 
was dissolved in water and purified by polyacryl amide gel 

3Q electrophoresis. Following electroelution of the vector band (120 v 
for 2 h at 0*C in 0.1 tiroes TBE (Mantatis et aK, Id.)) the DNA was 
purified by phenol/CHCl 3 extraction, ethanol precipitation and 
ethanol washing. 



0992Y 



-43- 



0130756 



10 



Although p*222 could be digested to completion (>98 percent) by 
either Knpl or PstI separately, exhaustive double digestion was 
incomplete {«50 percent). This may have resulted from the fact 
that these sites were so close (10 bp) that digestion by Knpl 
allowed "breathing" of the DMA in the vicinity of the PstI site, 
i.e., strand separation or fraying. Since PstI will only cleave 
double stranded DNA, strand separation could inhibit subsequent PstI 
digestion. 



Example 14 

Ligation of Oligonucleotide Casettes into the Subtilisin Gene 

15 10 uM of four complementary oligonucleotide pools (A-D, Table 1 

below) which were not 5* phosphorylated were annealed in 20 M l 
ligase buffer by heating for five minutes at 68'C and then cooling 
for fifteen minutes at room temperature. 1 uM of each annealed 
oligonucleotide pool, ~0.2 »g Kpnl and PstI -digested P a222 obtained 
20 in Example 13, 0.5 mM ATP, ligase buffer and 6 units T 4 DNA ligase 
in 20 uL total volume was reacted overnight at 14'C to ligate the 
pooled cassettes in the vector. A large excess of cassettes (~300x 
over the pa222 ends) was used 1n the ligation to help prevent 
intramolecular KpnI-Kpnl ligation. The reaction was diluted by 
25 adding 25 U L of 10 mM Tris pH 8 containing 1 mM EDTA. The mixture 
was reannealed to avoid possible cassette concatemer formation by 
heating to 68°C for five minutes and cooling for 15 minutes at room 
temperature. The ligation mixtures from each pool were transformed 
separately into E. coH 294 rec + cells. A small aliquot from each 
30 transformation mixture was plated to determine the number of 
independent transformants. The large number of transformants 
indicated a high probability of multiple mutagenesis. The rest of 
the transformants ("200-400 transformants) were cultured in 4 mi of 
LB medium plus 12.5 u g chloramphenicol /ml . DNA was prepared from 
35 each transformation pool (A-0). This ONA was digested with Kpnl, 
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"0.1 y g was used to retransform £. col i rec + and the mixture was 
plated to isolate individual colonies from each pool. Ligation of 
the cassettes into the gene and bacterial repair upon transformation 
destroyed the Kpnl and PstI sites* Thus, only pa222 was cut when 
5 the transformant DNA was digested with Kpnl. The cut plasmid would 
not transform £. coli . Individual trans formants were grown in 
culture and DNA was prepared from 24 to 26 transformants per pool 
for direct plasmid sequencing. A synthetic oligonucleotide primer 
having the sequence 5 1 -GAGCTTGATGTCATGGC-3 1 was used to prime the 
10 dideoxy sequencing reaction. The mutants which were obtained are 
described in Table C below. 

Two codon+222 mutants (i.e., gin and He) were not found after 
the screening described. To obtain these a single 25ner 

15 oligonucleotide was synthesized for each mutant corresponding to the 
top oligonucl tide strand in Figure 12. Each was phosphorylated 
and annealed to the bottom strand of its respective 
nonphosphorylated oligonucleotide pool (i.e., pool A for gin and 
pool D for ile). This was Ugated Into Kpnl and PstI digested pa222 

20 and processed as described for the original oligonucleotide pools. 
The frequency of appearance for single mutants obtained this way was 
2/8 and 0/7 for gin and ile, respectively. To avoid this apparent 
bias the top strand was phosphorylated and annealed to its 
unphosphorylated complementary pool. The heterophosphorylated 

25 cassette was ligated into cut pa222 and processed as before. The 
frequency of appearance of gin and ile mutants was now 7/7 and 7/7, 
respectively. 

The data in Table C u.*nonstrate a bias in the frequency of 
30 mutants obtained from the pools. This probably resulted from 

unequal representation of oligonucleotides in the pool. This may 
have been caused by unequal coupling of the particular trimers over 
the mutagenesis codon in the pool. Such a bias problem could be 
remedied by appropriate adjustment of trimer levels during synthesis 
35 to reflect equal reaction. In any case, mutants which were not 
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isolated in the primary screen were obtained by synthesizing a 
single strand oligonucleotide representing the desired mutation, 
phosphorylating both ends, annealing to the pool of 
non-phosphorylated complementary strands and li gating into the 
cassette site. A biased heteroduplex repair observed for the 
completely unphosphorylated cassette may result from the fact that 
position 222 is closer to the 5' end of the upper strand than it is 
to the 5' end of the lower strand (see Figure 12). Because a gap 
exists at the unphosphorylated 5' ends and the mismatch bubble in 
the double stranded DNA is at position 222. excision repair of the 
top strand gap would more readily maintain a circularly tybridized 
duplex capable of replication. Consistent with this hypothesis is 
the fact that the top strand could be completely retained by 
selective 5' phosphorylation. In this case only the bottom strand 
contained a 5' gap which could promote excision repair. This method 
is useful in directing biased incorporation of synthetic 
oligonuclotide strands when employing mutagenic oligonucleotide 
cassettes. 

Example 15 

Site-Specific Mutagenesis of the Subtilisin Gene at Position 166 

The procedure of Examples 13-14 was followed in substantial 
detail, except that the mutagenesis primer differed (the 37 mer 
shown in Fig. 13 was used), the two restriction enzymes were SacI 
and Xmalll rather than PstI and Kpnl and the resulting constructions 
differed, as shown in Fig. 13. 

Bacillus strains excreting mutant subtil isins at position 166 
were obtained as described below in Example 16. The mutant 
subtilisins exhibiting substitutions of ala, asp, gin, phe, his, 
lys, asn, arg, and val for the wild- type residue were recovered. 
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Example 16 



Preparation of Mutant Subti 1 i sin Enzymes 



B. subtil is strain BG2036 obtained by the method of Example 11 
was transformed by the plasmids of Examples 14, 15 or 20 and by 
pS4-5 as a control. Transformants were plated or cultured in shaker 
flasks for 16 to 48 h at 37'C in LB media plus 12.5 ug/ml 
chloramphenicol. Mutant enzymatically active subtilisin was 
recovered by dialyzing cell broth against 0.01M sodium phosphate 
buffer, pH 6.2. The dialyzed broth was then titrated to pH 6.2 with 
IN HC1 and loaded on a 2.5 x 2 cm column of CM cellulose (CM-52 
Whatman). After washing with 0.01M sodium phosphate, pH 6.2, the 
subtil isins (except mutants at position +222) were eluted with the 
same buffer made 0.08N in NaCl . The mutant subtilisins at position 
+222 were each eluted with 0.1M sodium phosphate, pH 7.0. The 
purified mutant and wild type enzymes were then used in studies of 
oxidation stability, Km, Kcat, Kcat/Km ratio, pH optimum, and 
changes in substrate specificity. 
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Table C Oligonucleotide Pool Organization 
and Frequency of Mutants Obtained 



Pool Amino Acids Codon-222a Frequencyb 

A asp GAT 2/25 

met ATG 3/25 

cys TGT 13/25 

arg AGA 2/25 

gin GAA 0/25 

unexpected mutants* 5/25 

B leu CTT 1/25 

pro CCT 3/25 

phe TTC 6/25 

tyr TAC 5/25 

his CAC 1/25 

unpexpected mutants 9/25 

C glu GAA 3/17 

ala GCT 3/17 

thr ACA 1/17 

lys AAA 1/17 

asn AAC 1/17 

unexpected mutants 8/17 

D gly GGC 1/23 

trp TGG 8/23 

ile ATC 0/23 

ser AGC 1/23 

val GTT 4/23 

unexpected mutants 9/23 



Codons were chosen based on frequent use in the cloned 
subtil isin gene sequence (Wells et al_. , 1983, Id.). 

Frequency was determined from single track analysis by direct 
plasmid sequencing. 

Unexpected mutants generally comprised double mutants with 
changes In codons next to 222 or at the points of ligation. 
These were believed to result from impurities in the 
obigonucleotlde pools and/or erroneous repair of the gapped 
ends. 



0130756 



Example 17 

Mutant Subtilisin Exhibiting Improved Oxidation Stability 

Subtilisins having cysteine and alanine substituted at the 222 
position for wild-type methionine (Example 16) were assayed for 
resistance to oxidation by incubating with various commit? ati ens of 
sodium hypochlorite (Clorox Bleach). 

To a total volume of 400 pi of 0.1M, pH 7, NaP0 4 buffer 
containing the indicated bleach concentrations (Fig. 14) sufficient 
enzyme was added to give a final concentration of 0.016 mg/ml of 
enzyme. The solutions were incubated at 25*C for 10 min. and 
assayed for enzyme activity as follows: 120 pi of either ala+222 or 
wild type, or 100 pi of the cys+222 incubation mixture was combined 
with 890 pi 0.1M tris buffer at pH 8.6 and 10 pi of a sAAPFpN 
(Example 18) substrate solution (20 mg/ml In DMSO). The rate of 
increase in absorbance at 410 nm due to release of p-nitroaniline 
(Del Mar, E.G., et al_., 1979 "Anal. Blochem." 99, 316-320) was 
monitored. The results are shown in Fig. 14. The alanine 
substitution produced considerably more stable enzyme than either 
the wild-type enzyme or a mutant in which a labile cysteine residue 
was substituted for methionine. Surprisingly, the alanine 
substitution did not substantially interfere with enzyme activity 
against the assay substrate, yet conferred relative oxidation 
stability on the enzyme. The serine+222 mutant also exhibited 
improved oxidation stability. 

Example 18 

Mutant Subtilisins Exhibiting Modified Kinetics and Substrate 
Specificity 

Various mutants for glycine*166 were screened for modified 
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,«.<«c Kinetic parameters were obtained by 
Kcat, Km and Kcat/Km ratios. v 

* u Bnr6CC curves of the reactions. The rate of 
analysis of the progress curves v 

. ac 'function of substrate concentration. Data 
rection was measured as a Tunttiu< 

* i»rf hv fitting to the Michael is-MenUn equation using the 
was ana.yzed by fitting fM _ Mia ^t n. U. 196 



10 



15 



was ana yzea Dy nums >~ - - 

non-linear recession «* ^« ,Ha """ r "- "l 19 "' 

" S.c. .nd. Appl. Hath." U. «!-«>. All reactions were 

„ . j „ M-c in 0 1M tris buffer, pH 8.0, containing 
conducted at Z5 c in u.m u law-Mi- Veoa 

(depend^ on the value of Km for the en^e of , te est 
concentrations were adjusted in each measurement so as to exceed 
or succinyl-L-AHnyl-L-AUnyl-L-Pro^l-L-Phenylalanyl-P-n tro- 

,Ui Vena Biochemical at initial concentrations of 
0.0010 M to 0.00028 M (varytn, « scribed for 

T ne results obtained in these experiments -ere as follows: 

Table D 

r ^ Kcat(s-l) Km (M) _Kcat/Km_ 
20 Substrate Enzyme Kcat_is — >_ 

sAAPFpN gly-166(wild type) 
ala+166 
asp+166 

25 glu+166 

phe+166 
hys+166 
lys+166 
asn+166 

30 arg+166 

val+166 

BYGRpN Mild T yP e 

asp+166 

glu+166 

35 asn+166 



37 


1.4xl0" 4 


3 x 10 5 


19 


2.7xl0" 5 


7 x 10 5 


3 


5.8xl0 -4 


5 x 10 3 


11 

3 


3.4xl0~ 4 
1.4xl0' 5 


3 x 10 4 
2 x 10 5 


15 


l.lxlO" 4 


1 x 10 5 


15 


3.4xl0" 5 


4 x 10 5 


26 


1.4xl0" 4 


2 x 10 5 


19 


6.2x*G~ 5 


3 x 10 5 


i 


1.4xl0" 4 


1 x 10 4 


2 


l.lxlO" 3 


2 x 10 3 


2 


4.1xl0" 5 


5 x 10 4 


2 


2.7xl0" 5 


7 x 10 4 


1 


1.2xl0" 4 


8X 10 3 
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The Kcat/Km ratio for each of the mutants varied from that of 
the wild-type enzyme. As a measure of catalytic efficiency, these 
ratios demonstrate that enzymes having much higher activity against 
a given substrate can be readily designed and selected by screening 
5 in accordance with the invention herein. For example, A166 exhibits 
over 2 times the activity of the wild type on sAAPFpN. 

This data also demonstrates changes in substrate specificity 
upon mutation of the wild type enzyme. For example, the Kcat/Km 
10 ratio for the D166 and E166 mutants is higher than the wild type 
enzyme with the BVGpN substrate, but qualitatively opposite results 
were obtained upon Incubation with sAAPFpN. Accordingly, the D166 
and E166 mutants were relatively more specific for BVGRpfJ than for 
sAAPFpN . 

15 

Example 19 

Mutant Subtil isin Exhibiting Modified pH-Activity Profile 

20 The pH profile of the Cys+222 mutant obtained in Example 16 was 
compared to that of the wild type enzyme. 10 u l of 60 mg/ml sAAPFpN 
in DMS0, 10 ul of Cys+222 (0.18 mg/ml) or wild type (0.5 mg/ml ) and 
980 ul of buffer (for measurements at pH 6.6, 7.0 and 7.6, 0.1M 
NaP0 4 buffer; at pH 8.2, 8.6 and 9.2, 0.1M tris buffer; and at pH 

25 9.6 and 10.0, 0.1M glycine buffer), after which the initial rate of 
change in absorbance at 410 nm per minute was measured at each pH 
and the data plotted in Fig. 15. The Cys+222 mutant exhibits a 
sharper pH optimum than the wild type enzyme. 

30 Example 20 

Site-Specific Mutagenesis of the Subtil isin Gene at Position 169 

The procedure of Examples 13-14 was followed in substantial 
35 detail, except that the mutagenesis primer differed (the primer 
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shown in Fig. 16 was used), the two restriction enzymes were Kpnl 
and EcoRV rather than PstI and Kpnl and the resulting constructions 
differed, as shown in Fig. 16. 

Bacillus strains excreting mutant subtilisins at position 169 
were obtained as described below in Example 16. The mutant 
subtilisins exhibiting substitutions of ala and ser for the 
wild-type residue were recovered and assayed for changes in kinetic 
features. The assay employed SAAPFpN at pH 8.6 in the same fashion 
as set forth in Example 18. The results were as follows: 

Table E 

Enzyme_ Kcat (s' 1 ) Km (M) Kcat/Km 

ala+169 58 7.5 x 10" 5 8 x 10 5 

ser+169 38 8.5 x 10" 5 4 x 10 5 



Example 21 

Alterations In S pecific Activity on a Protein Substrate 

Position 166 mutants from Examples 15 and 16 were assayed for 
alteration of specific activity on a naturally occuring protein 
substrate. Because these mutant proteases could display altered 
specificity as well as altered specific activity, the substrate 
should contain sufficient different cleavage sites i.e., acidic, 
basic, neutral, and hydrophobic, sc *s not to bias the assay toward 
a protease with one type of specificity. The substrate should also 
contain no derlvltized residues that result In the masking of 
certain cleavage sites. The widely used substrates such as 
hemoglobin, azocollogen, azccaseln, dimethyl casein, etc., were 
rejected on this basis. Bovine casein, a and a 2 chains, was 
chosen as a suitable substrate. 
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A 1 percent casein (w/v) solution was prepared in a 100 mM Tris 
buffer. pH 8.0, 10 nti EDTA. The assay protocol is as follows: 

790 »jl 50 raM Tris pH 8.2 
5 100 nl 1 percent casein (Sigma) solution 

10 nl test enzyme (10-200 ugh 

This assay mixture was mixed and allowed to incubate at room 
temperature for 20 minutes. The reaction was terminated upon the 

10 addition of 100 M l 100 percent trichloroacetic acid, followed by 
incubation for 15 minutes at room temperature. The precipitated 
protein was pelleted by centr fugation and the optical density of 
the supernatant was determined spectrophotometrically at 280 nm. 
The optical density is a reflection of the amount of unprecipitated, 

15 i.e., hydrolyzed. casein in the reaction mixture. The amount of 
casein hydrolysed by each mutant protease was compared to a series 
of standards containing various amounts of the wild type protease, 
and the activity is expressed as a percentage of the corresponding 
wild type activity. Enzyme activities were converted to specific 

20 activity by dividing the casein hydrolysis activity by the 280 nm 
absorbance of the enzyme solution used In the assay. 

All of the mutants which were assayed showed less specific 
activity on casein than the wild type with the exception of Asn+166 
25 which was 26 percent more active on casein than the wild type. The 
mutant showing the least specific activity was ile+166 at 0.184 of 
the wild type activity. 



30 



35 



0992Y 



-53- 



0130756 



CLAIMS 

1. A method of preparing a procaryotic carbonyl hydrolase 
which comprises: culturing a recombinant host cell 

5 transformed with an expression vector comprising the DNA 
sequence encoding the hydrolase, and recovering the 
hydrolase from the cell culture. 

2. The method of claim 1 wherein the hydrolase is a 

10 protease, preferably a subtilisin or a roetalloprotease, most 
preferably a subtilisin, the DNA sequence preferably 
encoding subtilisin in the form of prosubtilisin or 
preprosubtilisin . 

15 3. The method of claim 1 wherein the recombinant host 
cell is a strain of Bacillus , preferably a Bacillus 
subtilis . 

4. The method of claim 2 wherein the DNA sequence 
20 encoding subtilisin is operably linked to its native 
promoter, to a promoter homologous to the host which is 
other than the native promoter, or to a promoter which is 
heterologous to the host. 

25 5. The process of claim 2 wherein the recombinant host 
cell was transformed with an effective expression vector for 
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the DNA sequence encoding the procaryotic protease operably 
linked to its signal sequence. 

6. A composition comprising a procaryotic carbonyl 
5 hydrolase, preferably a Bacillus hydrolase, and a host 

microorganism transformed so as to be capable of expressing 
the hydrolase, 

7. A composition comprising prepro-, pre- or procarbonyl 
10 hydrolase, preferably prosubtilisin, essentially free of 

cells which express said prepro-, pre- or procarbonyl 
hydrolase. 

8. A liquid detergent composition comprising B. 
15 amyloliquef aciens subtilisin. 

9. An expression vector for a procaryotic carbonyl 
hydrolase which comprises a DNA sequence encoding the 
hydrolase operably linked to a promoter compatible with a 

20 suitable host cell, 

10. A recombinant expression vector comprising a DNA 
sequence encoding a prepro- or procarbonyl hydrolase, 
preferably subtilisin, operably linked to a promoter 

25 compatible with a suitable host cell; or a cell, preferably 
a strain of Bacillus , transformed by a said vector. 
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11. A method comprising: 

(a) isolating a DNA moiety encoding a procaryotic 
carbonyl hydrolase; 

(b) introducing a mutation into a predetermined region 
in the DNA which, upon expression of the DNA, results in the 
substitution, deletion or insertion of at least one amino 
acid at a predetermined site in the hydrolase; 

and optionally 

(c) transforming a suitable host with the mutated DNA 
of step (b) and recovering the expression product of the 
mutated DNA. 

12. The method of claim 11 wherein the mutation is 
predetermined; preferably expressed as the substitution or 
insertion of a single amino acid. 

13. The method of claim 11 wherein the DNA was isolated as 
a fragment of genomic DNA from an organism expressing the 
carbonyl hydrolase; the fragment preferably consisting 
essentially of the structural gene for the hydrolase. 

14. The method of claim 11 wherein the hydrolase is a 
protein hydrolase or lipase, preferably Bacillus subtilisin, 
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prosubtilisin, preprosubtilisin, a metal loprotease or other 
peptide hydrolase. 

15. The method of claim 14 wherein the hydrolase is 

5 Bacillus subtilisin and the mutation is introduced into the 
subtilisin at aspartate+32, asparagine+155 , tyrosine+104 , 
methionine+222, glycine+166, histidine+64 , serine+221, 
glycine+169, glutamate+156 , serine+33, phenylalanine+189 , 
tyrosine+217 and/or alanine+152. 

10 

16. The method of claim 14 wherein the hydrolase is 
prosubtilisin or preprosubtilisin and the mutation is 
introduced into the presubt ilisin or preprosubtilisin at 
tyrosine-1. 

15 

17. The method of claim 11 wherein the mutation is 
expressed as a mutant carbonyl hydrolase exhibiting a change 
in one or more of the oxidation stability, Km, Kcat, Kcat/Km 
ratio, substrate specificity, specific activity or pH 

20 optimum of the hydrolase. 

18. The method of claim 14 wherein the hydrolase is a 
peptide hydrolase selected from o-aminoacy lpeptide 
hydrolase, pept idylamino-acid hydrolase, acylamino 

25 hydrolase, serine carboxypept idase, metallocarboxypeptidase, 
thiol proteinase, carboxylproteinase or metalloproteinase. 
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19. DNA encoding a predetermined mutant of a procaryotic 
carbonyl hydrolase; or a vector capable of transforming a 
host cell to produce a mutant bacterial carbonyl hydrolase, 
which vector comprises such DNA and which, upon 

5 transformation of the host cell, results in expression of 
the mutant hydrolase; or a host cell transformed with a said 
vector. 

20. A mutant carbonyl hydrolase of the kind obtained by 
10 the method of claim 11. 

21. A composition comprising the hydrolase of claim 20 in 
combination with a detergent, preferably a liquid detergent 
or a detergent in granular form; optionally additionally 

15 comprising a builder, bleach or fluorescent whitening agent. 

22. The composition of claim 11 wherein the detergent is a 
linear alkyl benzene sulfonate, alkyl ethoxylated sulfate, 
sulfated linear alcohol or ethyoxylated linear alcohol. 

20 

23. The method of claim 17 wherein a mutant exhibiting a 
different substrate specificity is recovered. 

24. The method of claim 11 wherein the mutation is within 
25 the enzyme active site. 
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25. The method of claim 11 wherein the DNA is expressed in 
a bacterial host cell, preferably a Bacillus species. 

26. The method of claim 11 wherein the mutation is 

5 expressed as the substitution of one or more methionine, 
tryptophan, cysLelue or lysine residues by * aubstituent 
amino acid residue not one of methionine, tryptophan, 
cysteine or lysine, preferably alanine or serine. 

10 27. The method of claim 17 wherein the mutation renders 
the mutant enzyme either less oxidation stable or more 
oxidation stable than the precursor enzyme. 

28. DNA encoding a predetermined enzymatically active 

15 mutant of a precursor procaryotic carbonyl hydrolase; said 
mutant exhibiting a different substrate specificity, 
oxidation stability and/or pH-activity profile than the 
precursor enzyme; or a vector capable of transforming a host 
cell to produce a mutant enzyme, which vector comprises such 

20 DNA and which, upon transformation of the host cell, results 
in expression of the mutant enzyme; or a host cell 
transformed with such a vector. 

29. A method comprising culturing a host cell of claim 28 
25 and recovering therefrom the mutant enzyme. 
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30. A substantially normally sporulating Bacillus which is 
incapable of excreting subtilisin or neutral protease. 

31. The Bacillus of claim 30, preferably free of Bacillus 
5 strains capable of excreting subtilisin or neutral protease, 

and comprising a mutant neutral protease gene, preferably 
nonreversible, which gene contains a deletion that results 
in no expression or, upon expression, an enzymatically 
inactive polypeptide. 

10 

32. The Bacillus of claim 30 which is (a) incapable of 
excreting neutral protease and (b) transformed with at least 
one DNA moiety encoding a polypeptide not otherwise 
expressed by the Bacillus , preferably a subtilisin mutant or 

15 a eucaryotic protein. 

33. A vegetative-phase Bacillus culture which is 
essentially free of neutral protease, or which is 
essentially free of subtilisin. 

20 

34. A Bacillus culture free of any gene capable of 
expressing enzymatically active neutral protease, or free of 
any gene capable of excreting enzymatically active 
subtilisin. 

25 

35. A vector comprising a deletion mutated neutral 
protease gene or a deletion mutated subtilisin gene. 
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36. A method comprising culturing the Bacillus of claim 30 
until a protein not subtilisin or neutral protease, 
preferably amylase, has accumulated in the culture, and 
recovering the protein. 

5 

37. A method comprising: 

(a) obtaining a DNA moiety encoding at least a portion 
of said precursor protein; 

10 

(b) identifying a region within the moiety: 



(c) substituting nucleotides for those already 
existing within the region in order to create at least one 
15 restriction enzyme site unique to the moiety, whereby unique 
restriction sites 5* and 3' to the identified region are 
made available such that neither alters the amino acids 
coded for by the region as expressed; 



20 (d) synthesizing a plurality of oligonucleotides, the 

5' and 3' ends of which each contain sequences capable cf 
annealing to the restriction enzytnc sites introduced in step 
(c) and which, when ligated to the moiety, are expressed as 
substitutions, deletions and/or insertions of at least one 

25 amino acid in or into said precursor protein; 
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(e) digesting the moiety of step (c) with restriction 
enzymes capable of cleaving the unique sites; 

(f) ligating each of the oligonucleotides of step (d) 
5 into the digested moiety of step (e) whereby a plurality of 

mutant DNA moieties are obtained; 

and optionally the further steps of 

10 (g) expressing each of said moieties as a mutant 

protein in a suitable host; 

(h) recovering the mutant proteins of step (g); and 

15 (i) screening the step (h) mutant proteins for the 

desirable characteristic. 

38. The method of claim 37 wherein the restriction enzyme 
sites are different. 

20 

39. The method of claim 37 wherein the oligonucleotides 
are less than about 50 bp. 

25 
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EcoR\ CIq\ fVw!l BomH\ 

P RPS^PRE^-PRO MAT TERM 

h* — l 5*b M 



B. 



r*8S 

1 ^.TrTarTflflAATATlATTrrATArTATACA^TTAATACACARAJnAATC^^ fiT(; 

,pr PRE' -on 

Arc GW Lw l« Trp !1# S«r Leu p »* *1* *1* LP" Hp Phe Tnr »*t Ala Ph? Glv W Thr Scr 

-oo ftf.A art aaa iAA ma » y. ,-. T c art tta ctr ttt km tta rcg tta atc ttt Arc atr GrR ttc GRC arc aca tcc 

- _rp .* PRO -6* 

- <«»r Al* Gin AU AU R1 v Lv < A*n GN Alii I vt tv* Tv He Val GW P*w IvS Al" Thr *pt <*r Thr MM 

pa t:t gcc cag gcg &ca gag aaa tca aac gar gaa aag aaa tat att gtc ggr ttt aaa rAR aca atg arc acr atg 

W Ala Ala lv* Lrt Irt Asr Val Hp W R1u trt Rl v Rlv ivi Val Rio ivt Gin Php Lrt Tvr Val A S p Ala 
iflO ARf r.rf RTT AA f . fiAf- A A/. Rft* RTf. ATT TfT AAA AAA RAT AAA AAA RTR fAA AAR fAA TTC AAA TAT RTA RAC GfA 

AU W Ala >r Ip-» »\n Rl-i lv* A1« V*' ivt Rlu lv» lv* Asp Prn W Val *la Tvr Val Rlu Rlu Asp 
J^A RCT TCA GC T ACA TTA AAC GAA AAA GCT GTA AAA AAA TTG AAA AAA C»AC CCG ARC GTC GCT TAC RTT AAA-AAA RAT 

-i I i * WAT _ j A 

Mii Val Ala hu it# Tvr Ala G'n Vr Val e ro Tyr Rlv Vat W Rln He 1 Ala Pro Ala Lp«< «>i W Rln 
lOO rflf RTA RfA TAT Rfr, Ttr r/r, far. TCC RTR C"l TAC RAT RTfl Tf* CAA ATT AAA RCC CCT RTT CTR r AC KT CAA 

?n r» an 

Rlv T vr Thr r,U V t<« V*l lv\ V*l •! * IHl t l# W U v 1 1* A^p W W Pro Hn I tv^ Val 

A?t GRf .TAC ACT V.A TCA AA T RTT AAA f.*A R/R- RT T. A T C RAT ART RRI ATf RAT TCT TCT CAT CfT RAT TTA AAR RTA 

Sfi (Vo *\m *> n A<n 

AU Rlv RW Ala ^ *Vt-Va* Prn Vr Rlu T*»r As« Prft' Ph#» Rln Asn Ami A^n His G'v Thr His Val Al a 
inQ rrt r.Rf RR5 R r f art r.»» rrT »r* Kaa a'a aat rr: ttc tab rat AAr aac tct cac RRA AfT rar rtt rtc 

7n or Vr Ala 00 

Rlv T hr Val Al.i *1 » *\" W 1 R»v I^m Rl» VaI Pr^ ^»r ft I a *+r Lpu Tvr Al* Val lv« 

6?A RGC ACA f.TT RfG RCT r 7 T 41* AAr Tf* AT? RR* RTA TIA R/X RTT RCf. CCA ARf RCA TCA f TT TAC RCT RTA AAA 

A*n Ala 1^ 

Val Gtv Ala Asp RU V»" R'» G 1 " 1 «r V<- Trf !1p I 1» A^« Rl v !> Rl., T rp *U J lp A1a A^n Asn Hpt 

foo r,jT rrr rrt rtt R«r rrt ir** rrt taa TAr arc t w. atc att «ac rra atc c/ir, trt. r.rR air R'a aac aat atr 
1 to I** M° 

A*p V^l Hp As*» W IPii Rlv tlf D rr Vr Rl» W Ala AU I »»i Al* Al « Val A^o lv* Ala V*l Ata 

774 GAC GTT ATT AAC ATR ARC CTC Rf.C RRA CCT TfT RAT TCT Rf T f,f» m AAA fXR RTT RAT AAA RCC RTT Rf A 

]V W Thr l*/> 

^er Rlv Val Val Val Val Ala AU Ala RU A\« Rlu Rl* TKr Kmr Rlv W W <ii»r Thr Val Rlv Tvr Prn Rlv 
Qfl= Tff RRC RTC RTA RT r RTT RTC Va RTf (XT AAf RAA W ACT TT RRT ARC TfA ftRC ACA RTR RRf TAC CCT RRT 

17n ion 

Lv* 'vr Prp W V*1 He Ala V*l Rlv Al« V*l A\f» W A* n Rln Arn Ala W PhP S»r W Val Rl v Pm 

Q?4 AAA TAC CCT TCT GTC ATT RCA RTA .RAT ATT f.TT AAf AW ARC AAC CAA AAA RCA TCT TTC TCA AREATA RRA C r T 

6l« A^p val Ala Prr» Rlv ul.Vf'H* Al« W T^r l»u-Pffi Civ A^n Lv^ Tvr RW Ala Tvr Asn Glv 
qoq fiAf, CTT RAT RTC ATR Rf A ffT RAT r.TA TCT AU CAA AAT A^R ^ T T CCT RRA AAr AAA TAC AAA RTR TAC AAf RRT 

?7r\ ' . ■ . >yn <4rt 

Thr >r **t Ala W »rtv hh Val *laftU Al« Ala A* a t#« ! 1# l*n Vr I M»s Prn Asn Trp Thr Atn Thr 

- 107A ACG TCA ATG CCA TCT CC6 CAC RTT RfC AAA RCA 6/T :/T TTA ATT CTT TCT AAR CAC CCG AAC TRR ACA AAr ACT 

?y> ri* -»v> 

Gin VAl Arq Ser R1.i A%n ftir T*r Thr in 6W A\o W Mtf Tyr Tyr Glv Lv^ Glv If" 1 li» Asn 
U4Q CAA RTC CAC ART ART T TA AAA AAf Aff AfT Aft AAA f T T ART AAT TCT TTC TAC TAT RRA AAA RGR CTR ATC AAC 

Val Rln Ata Ala Ai« Rln or i cnifi 

1'34 RTA CAR RCA ACA ATT r*R TAA AAf 1 TAAAAAAC fAAf M TAAt'f tt Af f ART T T T T TAT T A T TT T T CTTCCTrCAf.ATRTTCAATf f. AC TCf 
131ft AT AATCGACGRATRRC I C C C T C T GA AAA T T T TAA^RAAAAACAACAAAT rtACCCAATTf AATCCATAACAAT f AARTCCTAAAArRTrTCAATCRCCA 



IfllA C T T f rf RA T T T t f AA T r »Af Tf A A fj;' t UC AATT ART AAf At T T Tf f TAA T Af f AAA A r . Af ART ATT^ATAAT f AAA TC 
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Fig. 2. 

Panel A: probe I [ 32 P-AaCaa£aTGGA£gt] 




Panel B: probe 2 [ 32 P-AaTaa£aTGGA$Gt] 
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Fig.U 
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Fig.7. 



1 GATATACCTAAATAGAGATAAAATCATCTCAAAflAfttT&GGTCTACTAAAATATIATTCrATCTATTAC AATAAATTCACAG* ATAGTCTTTTAAGTAAG 

-100 

y *et Ar Q Ser i.*s Lys Leu Trp He Ser Leu Leu Phe Ala leu Thr Leu 
101 TCTACTCTGAATTTTTTTAAAAGGAGAGoGTAAAGA G?G AGA AGC AAA AAA 'TG TGG A TC AGC TTG TTG TTT GCG TTA ACG TTA 

-90 -80 -70 

lie Phe The Met A!a Phe Ser Asn w e t Ser Ala Gin Ala AVa Gl* Lvs S»r w Th- gi-j lys Lys T*r Me V«l 
185 ATC TTT ACG ATG GCG T TC AGC AAC ATG I C T GCG C AG GC t GCC GGA AAA AGC A&T AC A GAA AAG AAA TAC ATT GTC 

-60 -50 
Gly Phe Lys Gin Thr Met Ser Ala Met Ser Ser AW Lys Lys Lys Asp Val Me Ser Glu Lvs Gly Glv Ly, Val 
?60 GGA TTT AAA CAG AC* ATG AG? GCf ATG AGT TCC GCC AAG AAA AAG GAT GTT ATT TC T GAA AAA GGC GGA AAG GTT 



-40 -30 -20 

Gin Lys Gin Phe Lys r yr Val As* Ala Ala Ala Ala Thr Leu A sc > Glu Lvs Ala Val Lys Glu Leu Lys Lys Asd 

335 CAA AAG C AA TTT AAG TAT GTT AAC GCG GCC GC A GCA AC* TTG GAT GAA AAA GCT GT A AAA GAA TTG AAA AAA GAT 

-10 -11 10 

Pro Ser Val Ala Tyr val Glu Glu Asp His Me Ala **U Glu Tyr Ala Gin Ser Val Pro Tyr Gly He Ser Gin 

410 CCG AGC GTT GCA TAT GTG GAA GAA GAT CAT ATT GCA CAT GAA TAT GCG CAA TCT GTT CCT TAT GGC ATT TC T CAA 

?0 30 32 

lie Lys Ala Pro Ala Leu h-i $*r Gin Gly Tvr Thr Gly Ser A S n Val Lys Val Ala Val He Asp Ser Gly He 

485 ATT AAA GCG CCG GCT C TT CAC T C T CAA GGC TAC AC A GGC TCT &AC GTA AAA GTA GCT GTT ATC GAC AGC GGA ATT 

40 SO 60 

Asp Ser Ser His Pro Asp Leu Asn Val Arg Gly Gly Ai* Ser Phe Val Pro Ser Glu Thr Asn Pro Tyr Gin Asp 

560 GAC TCT TCT CAT CCT GAC TTA AAC GTC AGA GGC GGA GCA AGC TIC GTA CCT TCT GAA ACA AAC CCA TAC CAG GAC 

64 70 80 

Gly Ser Ser His Gly Thr Mis Val Ala Gly Thr Me Ala Ala Leu Asn Asn Ser lie 61y Val Leu Gly Val Ser 

635 GGC AGT TCT CAC GGT ACG CAT GTA GCC GGT ACG ATT GCC GCT CTT AAT AAC TC A ATC GGT GTT CTG GGC GTT AGC 

90 too no 

Pro ler Ala Ser Leu Tyr Ala Val l>* val L*u Asp Ser Thr Gly Ser Gly Gin Tyr Ser Trp He He Asn Gly 

710 CCA AGC GCA TC A TTA TAT GCA GTA AAA GTG CTT &AT TC A ACA GGA AGC GGC CAA TAT AGC T6G ATT ATT AAC GGC 

WO 130 

He Glu Trp Ala He Ser Asn Asn Met Asp Val tie Asn Met Ser Leu Gly Gly Pro Thr Gly Ser Thr Ala Leu 

785 ATT GAG TGG GCC ATT TCC AAC AAT ATG GAT GTT ATC AAC ATG AGC CTT GGC GGA CCT ACT GGT TCT ACA GCG CTG 

140 ISO 160 

Lys Thr Val Val Asp Lys Ala Val Ser Ser Gly He Val Val Ala Ala Ala Ala Gly Asn Glu Gly Ser Ser Gly 

860 AAA ACA GTC GTT GAC AAA GCC GTT TCC AGC GGT ATC GTC GTT GCT GCC GCA GCC GGA AAC GAA GGT TCA TCC GGA 

170 180 

Ser Thr Ser Thr Val Gly Tyr Pro Ala Lys Tyr Pro Ser Thr He Ala Val Gly Ala Val Asn Ser Ser Asn Gin 

935 AGC ACA AGC ACA GTC GGC TAC CCT GCA AAA TAT CCT TCT ACT ATT GCA GTA GGT GCG GTA AAC AGC AGC AAC CAA 

190 ?00 210 

Arg Ala Ser Phe Ser Ser Ala Gly Ser Glu leu Asp Val Met Ala Pro Gly Val Ser He Gin Ser Thr Leu Pro 

1010 AGA GCT TCA TTC TCC AGC GCA JGT TCT GAG CTT GAT GTG ATG GCT CCT GGC GTG TCC ATC CAA AGC ACA CTT CCT 

22^ 221 230 

Gly Gly Thr Tyr Gly Ala Tyr Asn Gly Thr Ser let Ala Thr Pro His Val Ala Gly Ala Ala Ala Leu lie Leu 

1085 GGA GGC ACT TAC GGC GCT TAT AAC GGA ACG TCC AT6 6CG ACT CCT CAC GTT GCC GGA GCA GCA GCG TTA ATT CTT 



240 250 260 

Ser Lys His Pro Thr Trp Thr Asn Ala 61" Val Are Asp Arg Leu Glu ier Thr Ala Thr Tyr Leu 61y Asn Ser 
1160 TCT AAG CAC CCG ACT TGG ACA AAC GCG CAA GTC CGT GAT CGT TTA GAA AGC ACT GCA ACA TAT CTT GGA AAC TCT 



270 

Phe Tyr Tyr Gly Lys Gly Leu He Asn Val Gin Ala Ala Ala Gin OC 
1235 TTC TAC TAT GGA AAA &GG TTA ATC AAC GTA CAA GCA GCT GCA CAA TAA T A6TAAAAAGAA6CA66TTCCTCCATACC TGCTT C 



1318 TTTTTATTTGTCAGCATCCTGATGTTCC6GCGCATTCTCTTCTTTCTCCCCATGTT6AATCCGTTCCATGATCGAC6GATGGCTGCCTCTGAAAATCTTC 
1418 ACAA6CACCGGAGGATCAACCT6CTCA6CCCC6TCACGGCCAAATCCTGAAAC6TTTTAACACTG6CTTCTCTGTTCTCTGTC 
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